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Advanced  Pattern  Recognition  Techniques 

(RTO  EN-2) 

Executive  Summary 

Pattern  recognition  has  its  roots  in  biological  evolution.  It  is  the  extraction  of  consistent  information 
from  noisy  spatiotemporal  data  and  is  currently  being  used  in  systems  for  battlefield  supervision,  smart 
weapons,  anti-counterfeiting  of  all  kinds,  and  for  the  reduction  of  false-alarm  rates  in  the  detection  of 
land  mines  and  unexploded  ordnance. 

Conventional  methods  of  pattern  recognition  are  mainly  based  on  statistical  approaches,  such  as  density 
estimates  or  discriminant  analysis.  In  this  sense  artificial  neural  networks  can  be  regarded  as  an 
extension  of  these  techniques.  Fuzzy  methods  originate  from  control  theory,  but  have  also  proven 
successful  in  pattern  recognition.  Over  time,  neuro-fuzzy  methods  have  emerged  that  try  to  combine  the 
advantages  of  each  method  and  minimize  the  drawbacks. 

An  important  task  in  pattern  recognition  is  to  choose  the  right  features.  Therefore  a  main  part  of  this 
Lecture  Series  was  devoted  to  feature  extraction.  This  can  be  achieved  in  several  ways:  by 
electromagnetic  and  acoustic  singularity  expansion  methods,  by  model  based  scattering  signatures,  also 
by  using  multiresolution  time  and  frequency  domain  analysis,  by  SAR/ISAR  imaging,  bistatic 
microwave  imaging  and  electromagnetic  inversion  techniques. 

Practical  applications  of  pattern  recognition  techniques  were  demonstrated  with  focus  on  statistical 
methods  and  artificial  neural  networks.  Real-time  software  for  discriminant  and  principal  component 
analysis  as  well  as  PC  based  accelerator  boards  with  on  chip  artificial  neurons  were  introduced. 
Different  methods  for  feature  extraction  with  examples  for  automatic  pattern  recognition  were  shown. 

The  material  in  this  publication  was  assembled  to  support  a  Lecture  Series  under  the  sponsorship  of  the 
Sensors  and  Electronics  Technology  Panel  and  the  Consultant  and  Exchange  Programme  of  RTO 
presented  on  14-15  September  1998  in  Bristol,  UK,  on  17-18  September  1998  in  Rome,  Italy,  and  on 
21-22  September  1998  in  Lisbon,  Portugal. 


Techniques  avancees  de  reconnaissance  de  forme 

(RTO-EN-2) 


Synthese 

La  reconnaissance  de  forme  tire  son  origine  de  revolution  biologique  et  peut  etre  definie  comme 
r extraction  d’ informations  coherentes  a  partir  de  donnees  spatio-temporelles  brutes.  Elle  est  utilisee 
pour  la  surveillance  du  champ  de  bataille,  dans  les  munitions  intelligentes,  pour  la  contre  contrefafon  et 
pour  la  redution  des  taux  de  fausses  alarmes  dans  la  detection  des  mines  terrestres  et  des  munitions 
explosives  non  explosees. 

Les  methodes  classiques  de  reconnaissance  de  forme  s’inspirent  essentiellement  d’approches 
statistiques,  comme  les  estimations  de  densite  et  T  analyse  discriminante.  De  ce  point  de  vue,  les 
reseaux  neuronaux  peuvent  etre  consideres  comme  T extension  de  ces  techniques.  Les  methodes  floues 
derivent  de  la  theorie  de  commande,  mais  elles  ont  ete  employees  avec  succes  pour  la  reconnaissance 
de  forme.  Suite  a  ces  developpements,  sont  apparus  des  neuro-methodes  ayant  pour  ambition  de 
combiner  les  avantages  de  chaque  methode  tout  en  reduisant  au  minimum  leurs  desavantages. 

Le  choix  des  caracteristiques  appropriees  est  Tune  des  taches  essentielles  de  la  reconnaissance  de 
forme.  Par  consequent.  Tune  des  sessions  principales  de  ce  cycle  de  conferences  a  ete  consacree  a 
r extraction  des  caracteristiques.  Un  certain  nombre  de  techniques  ont  ete  examinees  a  savoir  : 

-  les  methodes  SEM  acoustiques  et  electromagnetiques 

-  la  modelisation  des  signatures  de  diffusion 

-  r  analyse  dans  les  domaines  temporel  et  frequentiel 

-  rimagerie  SAR/ISAR 

-  rimagerie  hyperfrequence  bistatique  et  les  methodes  inverses 

Des  applications  de  techniques  de  reconnaissance  de  forme  ont  ete  presentees,  1’ accent  etant  mis  sur  les 
methodes  statistiques  et  les  reseaux  neuronaux  artificiels.  Des  logiciels  de  gestion  temps  reel  pour 
r  analyse  discriminante  et  pour  1’ analyse  des  principaux  composants  ont  ete  demontres,  ainsi  que  des 
cartes  acceleratrices  pour  PC  integrant  des  neurones  artificiels  sur  puce.  Differentes  methodes 
d’ extraction  de  caracteristiques  ont  ete  exposees  avec  des  exemples  relatifs  a  la  reconnaissance  de 
forme. 

Les  textes  contenus  dans  cette  publication  ont  servi  de  support  au  Cycle  de  conferences  214  presente 
sous  I’egide  de  la  Commission  des  senseurs  et  technologies  de  Pelectronique  dans  le  cadre  du 
programme  des  consultants  et  des  echanges  de  la  RTO  du  14  au  15  septembre  1998  a  Bristol,  au 
Royaume-Uni,  du  17  au  18  septembre  1998  a  Rome  en  Italie,  et  du  21  au  22  septembre  1998  a  Lisbon 
au  Portugal. 
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Approaches  to  Pattern  Recognition 

Hendrik  Rothe 

e-mai  1 :  rothe@unibw-hairiburg .  de 
Tel.:  040/6541-2723  -  Fax:  040/6541-2743 
Mess-  und  Informationstechnik,  FB-MB,  Universitaet  der  Bundeswehr 
Holstenhofweg  85,  D-22043  Hamburg 


1  Introduction 

Pattern  Recognition^  per  se  covers  a  wide  range  of 
activities  from  many  areas  of  science,  engineering 
and  everyday  life.  It  has  a  long  and  respectable 
history  within  engineering  directed  to  military  ap¬ 
plications.  However,  the  cost  of  hardware  to  ac¬ 
quire  the  necesssary  data  (images  or  sensor  signals) 
restricted  its  broad  application  for  many  years. 
Nowadays,  it  is  possible  for  almost  everybody  to 
design  and  test  even  powerful  automated  pattern 
recognition  systems. 

Therefore  there  is  an  increasing  need  to  understand 
fundamentals  of  pattern  recognition  techniques. 

How  could  pattern  recogniton  be  defined?  A  good 
first  approximation  could  be: 

Given  some  examples  of  complex  sig¬ 
nals  and  their  correct  classification, 
make  correct  decisions  automatically 
for  a  stream  of  future  examples. 

The  roots  of  pattern  recognition  can  be  found  in  bi¬ 
ological  evolution,  since  many  of  us  humans  can, 
e.g- 

•  spot  changing  weather 

•  identify  thousands  of  species  (fiowers, 
plants,  animals) 

•  recognize  faces  and  voices 

In  science  and  technology  emerged  literally  thou¬ 
sands  of  pattern  recognition  tasks,  like: 

•  diagnosing  diseases 


•  identify  types  of  vehicles,  planes 

•  identify  fingerprints  and  DNA  profiles 

•  recognize  handwritten  characters  and  human 
voice 

•  picking  optimal  moves  in  certain  situations 
(chess,  war) 

•  identify  incoming  missiles  from  sensor  sig¬ 
nals 

•  detecting  land  mines  and  unexploded  ord¬ 
nance 

While  humans  are  able  to  do  many  of  these  tasks 
quite  well,  the  desire  is  to  construct  machines  per¬ 
forming  these  tasks  cheaper,  better,  faster  and  of 
course,  automatically.  Pattern  recognition  is  the 
engineering  discipline  of  building  such  machines. 

Because  humans  can  perform  many  pattern  recog¬ 
nition  tasks  very  well,  there  has  been  for  many 
years  an  interchange  of  ideas  between  engineers  in 
the  pattern  recognition  area  and  psychologists  and 
physiologists  doing  research  on  human  and  animal 
brains.  In  the  late  1950s  the  result  of  this  coop¬ 
eration  was  the  perceptron,  in  the  mid  1980s  ar¬ 
tificial  neural  networks  occured.  Both  approaches 
left  their  biological  roots  and  were  studied  by  exact 
mathematical  techniques  with  respect  to  their  engi¬ 
neering  performance. 

Human  pattern  recognition  is  mainly  learnt.  It  is 
not  possible  to  describe  the  rules  used  to  recognize 
a  certain  voice.  On  the  other  hand,  biologists  can 
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give  rules  to  discriminate,  e.g.  between  African  and 
Indian  elephant. 

Imagine  to  taste  a  large  number  of  unlabeled  tasses 
of  tea.  After  some  training  we  will  be  able  to  detect 
a  grouping  in  the  teas.  But  we  will  need  a  teacher, 
to  tell  us  that  our  favourite  group  is,  e.g.  Darjeeling 
GFOP,  first  flush. 

The  discovery  of  new  groupings  in  sets  of  data  is 
called  unsupervised  pattern  recognition.  The  more 
common  method  uses  a  training  set,  also  called 
recognition  set  with  predefined  groupings  of  data. 
This  is  called  supervised  pattern  recognition.  Af¬ 
ter  the  system  has  learned,  it  classifies  future  data, 
the  prediction  set  into  one  or  more  sets  of  learned 
classes. 

Pattern  recognition  should  not  be  confused  with 
machine  learning  which  was  fathered  by  the  ar- 
tifcial  intelligence  and  computer  science  commu¬ 
nities.  Machine  learning  labels  its  classes  in  gen¬ 
eral  only  with  true  ov  false,  while  pattern  recogni¬ 
tion  uses  much  more  classes.  In  general,  it  can  be 
said  that  machine  learning  mimics  human  reason¬ 
ing  by  logical  or  binary  operations  and  background 
knowledge  may  be  exploited. 

Now  we  can  define  the  pattern  recognition  task  in  a 
more  formal  way.  We  assume  a  given  set  of  1C  well 
defined  classes  and  a  classifier  C.  For  each  exam¬ 
ple  of  a  class  there  is  a  certain  number  of  features, 
ov  feature  variables.  These  features  are  fed  into  the 
classifier  C.  After  a  -  hopefully  -  short  time  the 
classifier  responds: 

•  this  example  is  from  class  )Ci 

•  this  example  is  from  none  of  the  classes 

•  this  example  is  too  complicated  for  mee 

The  second  category  contains  all  outliers,  while  the 
third  reports  all  rejects  or  doubts.  The  primary  as¬ 
sessment  of  C  will  be  its  performance  in  terms  of 
per  cents  of  correct  classification.  The  other  aspect 
is  the  power  of  explanation.  Users  need  to  have 
confidence  in  the  system  before  it  will  be  accepted. 
For  example,  no  one  really  cares  if  an  automatic 
Zip  code  reading  machine  rejects  an  odd  letter.  But 
when  a  civilian  airliner  is  classified  as  enemy  air¬ 
craft  and  shot  down,  or  an  area  is  classified  mine 
free,  and  explosions  happen,  serious  questions  are 
raised.  Therefore  in  many  cases  black  C  boxes  are 
totally  inacceptable. 


Very  often,  the  most  important  aspect  in  the  design 
of  a  classifier  is  to  choose  the  right  features.  If  the 
wrong  data  are  measured  the  task  may  become  very 
difficult  or  even  unachievable.  In  this  sense,  ad¬ 
vances  in  pattern  recognition  techniques  have  come 
from  better  features  instead  of  more  complex  clas¬ 
sifiers. 

2  Feature  Extraction 

As  mentioned  in  the  preceding  section,  feature  ex¬ 
traction  is  very  important  for  correct  pattern  recog¬ 
nition  techniques.  In  principle,  there  are  two  main 
approaches,  namely  induction  and  deduction. 
Induction  is  from  the  historic  point  of  view  the 
older  method.  It  was  used  before  the  advent  of 
modem  science.  The  point  is  that  certain  hypothe¬ 
ses  about  the  object  under  study  lead  to  the  extrac¬ 
tion  of  information  -  that  possibly  -  describes  the 
object  or  process  in  a  correct  manner.  Clearly,  ex¬ 
perience  is  of  fundamental  importance  in  this  case. 
However,  there  is  one  basic  principle  called  Oc¬ 
cam  's  Razor.  It  was  found  by  the  English  scientist 
William  of  Occam  int  middle  of  the  14th  century.  It 
says: 

Non  sunt  multiplicanda  entia  praeter 
necessitatem. 

With  respect  to  pattern  recognition  we  could  trans¬ 
late  as  follows: 

Use  as  few  feature  variables  as  possi¬ 
ble  to  provide  consistent  classification. 

Occams  Razor  is  starting  point  of  modem  statisti¬ 
cal  inference  theory,  which  provides  the  possibility 
to  assess  feature  variables  with  respect  to  their  spe¬ 
cific  discriminatory  power  or  information  content. 
Examples  for  such  methods  are  Principal  Compo¬ 
nent  Analysis  and  Discriminant  Analysis. 

It  should  be  mentioned  here  that  Artificial  Neural 
Networks,  Fuzzy  Sets  and  Neuro-Fuzzy-Methods 
are  not  capable  of  assessing  feature  variables,  nor 
can  they  give  information  about  the  stability  of  the 
classification  process,  or  the  probability  of  correct 
decisions. 

Deduction  is  a  method  which  is  typical  for  modem 
science  and  engineering.  It  works  as  follows: 
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•  Make  a  physical  model  of  the  object  or  pro¬ 
cess  under  study 

•  Choose  the  applicable  fundamental  laws  of 
nature 

•  Make  a  mathematical  model 

•  Solve  the  equations  of  the  mathematical 
model  either  analytically  or  numerically 

•  Find  the  variables  describing  the  interesting 
properties  of  the  system  -  these  are  the  fea¬ 
ture  variables 

Deduction  is  always  more  desirable  than  induction 
because  it  provides  insight.  However,  since  many 
processes  can  not  be  modeled  today  by  deduction, 
peaceful  co-existence  between  the  two  approaches 
will  last  for  a  long  time. 

In  this  paper,  only  feature  variables  derived  by  in¬ 
duction  will  be  used. 


2.1  ...  from  Images^ 

Features  derived  from  images  should  be  invariant 
with  respect  to  shift,  rotation  and  scale.  While  shift 
invariance  can  be  obtained  by  using  Fourier  trans¬ 
form  techniques,  rotational  invariance  requires  the 
transformation  of  the  image  in  the  polar  plane. 
Finally,  scale  invariance  can  be  achieved  by  the 
Mellin  transform.  Complete  invariance  implies  the 
use  of  all  three  approaches. 


Position,  Orientation,  Symmetry  These  are 
useful  descriptors  of  objects  within  images.  Posi¬ 
tion  refers  to  the  location  of  the  object  in  the  plane. 
The  object’s  centroid  (center  of  mass)  is  used  to 
specify  it’s  position.  Orientation  refers  to  how  an 
object  is  situated  in  the  plane.  The  object’s  mo¬ 
ment  of  inertia  is  used  to  determine  it’s  angle  of 
orientation.  Symmetry  is  the  ratio  of  the  minimum 
moment  of  inertia  to  the  maximum  moment  of  in¬ 
ertia.  For  binary  images  the  feature  symmetry  is  a 
rough  measure  of  how  elongated  an  object  is.  In 
this  sense,  a  circle  has  a  symmetry  of  equal  to  1,  a 
straight  line  of  equal  to  0. 


Moment  invariants  are  image  statistics  that  are 
independent  of  rotation,  translation  (shift)  and 
scale.  Moment  invariants  are  uniquely  determined 
by  an  image  and  vice  versa.  These  properties  facil¬ 
itate  pattern  recognition  by  a  great  extent. 

The  moments  invariants  can  be  derived  from  the 
definitions  of  moments,  centralized  moments,  and 
normalized  centralized  moments.  Let  /  be  a  con¬ 
tinuous  function  defined  over  R^.  The  moment  of 
order  (p,q)  of  /  is  defined  by 


TTlpq  — 


oo  oo 

j  J  xPy‘^f{x,y)dxdy 


—  OO  — OO 


wherep,  g  E  {0, 1, 2, . . .  }.  It  has  been  shown  that  if 
/  is  a  piecewise  continuous  function  with  bounded 
support,  then  moments  of  all  orders  exist,  and,  ad¬ 
ditionally,  rripq  is  uniquely  determined  by  /  and 
vice  versa.  The  central  moments  of  /  are  defined 
by 


However,  by  the  use  of  moments  one  can  achieve 
complete  invariance  without  complicated  and  time 
consumimg  numerical  computation. 

That  in  mind,  we  can  now  deal  with  some  simple 
feature  variables. 


Area  and  Perimeter  are  commonly  used  desrip- 
tors  for  regions  in  a  plane. 


Euler  Number  This  number  is  a  topological  de¬ 
scriptor  for  binary  images.  It  is  defined  to  be  the 
number  of  connected  components  minus  the  num¬ 
ber  of  holes  inside  the  connected  components. 


Fpq  — 


oo  oo 

J  j  {x-  xY{y  -  yYdxdy 

—oo  —oo 


with 


X  = 


moo 


and 

-  moi 

y  =  — 

moo 

The  point  (^,  y)  is  called  the  image  centroid  which 
is  the  same  as  the  center  of  gravity  for  a  rigid  body 
in  a  force  field. 

The  discrete  counterpart  of  the  centralized  moment 
of  order  (p,  q)  is  given  by 


oo  oo 

Fpq  ~  EE(-  -  ^Yiy  -  yYdxdy 
—  00  —00 
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The  normalized  central  moment  rjpq  is  defined  by 


where 


rj  -  ^ 

POO 


7  =  +  1 


It  is  also  possible  to  use  Hough  transform  for  the 
detection  of  lines  and  statistical  estimators  of  tex¬ 
ture,  like  energy,  entropy,  texture  correlation,  in¬ 
verse  difference  moments,  and  also  inertia.  This  is, 
however,  beyond  our  scope. 

22  ...  from  Sensor  Data 


The  following  seven  momets  invariants  developed 
by  Hu  are  in  the  continuous  case  independent  of 
rotation  shift  and  scale.  In  the  discrete  some  aber¬ 
rations  may  occur,  so  one  has  to  be  careful. 

=  ^20+^02 
^2  ^  feo  ~ 

^3  =  feo- 37/12)^ +  (37/21  -  7/03)^ 

^4  =  imo  +  vnf  +  {V2i  + 

^5  =  (»?30  -  3r/i2)(7?30  +  Vn) 

X  [(fyao  +  -  3(j?2i  +  t?03)^] 

+  (37?21  —  »?03)(f?2i  +  Vos) 

X  [3(7^30  +  7712)^  —  (7721  +  7703)^] 

^6  =  (7720  -  7702)  [(7730  +  7712)^  -  (7721  -  7703)^] 
+  47711  (773077i2)(7721  +  7703) 

$7  =  (37721  -  7730)  (7730  +  7721) 

X  [(7730  +  7712)^  -  3(7721  +  7703)^] 

+  (37712  —  7730)  (7721  +  7703) 

X  [3(7730  +  7712)^  -  (7721  +  7703)^] 

Very  often  also  so  called  form  factors  are  used  to 
extract  information  from  images.  A  very  popular 
one  is  compactness: 

^  Perimeter'^ 

compactness  = - 

47r  X  Area 


If  a  sensor  is  conceived  as  a  device  which  trans¬ 
forms  a  certain  physical,  chemical,  or  biological 
property  in  a  time  dependent  electrical  signal,  then 
we  have  to  deal  with  one-dimensional  problems. 
Because  of  the  many  advantages,  we  use  again  mo¬ 
ments,  but  only  one-dimensional  ones.  Since  sen¬ 
sor  data  are  always  digitized,  we  have  discrete  data 
only.  Therefore  we  are  now  in  the  domain  of  such 
statistics,  which  deals  with  measurements  of  a  cer¬ 
tain  variable  at  a  certain  time.  Table  1  displays 
so  called  empirical  statistical  moments  of  a  one¬ 
dimensional  probability  distribution. 

For  all  our  classification  examples,  we  will  use 
these  features. 

Furthermore,  Table  2  shows  actual  data  of  degra¬ 
dation  measurements  for  high  performance  optics. 
There  are  four  damage  effects  due  to  environmen¬ 
tal  problems  which  degrade  image  quality.  This  is 
the  test  set  for  all  pattern  recognition  approaches 
discussed  in  this  paper.^’"^’^ 

Figure  1  shows  the  backscatter-curves  for  four  dif¬ 
ferent  error  pattern.  They  are  significantly  different 
from  each  other.  There  are  24  observations  -  1  per 
sample  -  and  namely  6  per  error  pattern.  Five  ob¬ 
servations  of  every  group  form  the  recognition  set 
Ti  which  is  used  for  the  supervised  learning  of  the 
classification  algorithm  employed.  Observations 
1,  7,  13  and  19  are  used  for  testing  the  predictive 
power  of  the  classification  approaches  under  study. 
The  ocurrence  of  the  4  error  pattern  is  in  general 
equally  likely. 


3  Assessment  of  Features  by  Prin¬ 
cipal  Component  Analysis 

The  design  matrix  of  a  Principal  Component 
Analysis®’'^’*  (PCA)  can  easily  be  defined  by  con¬ 
sidering  Table  3.  For  example,  the  =  1, . . .  ,  n 
objects  or  sample  members  are  subjects,  com¬ 
pounds,  plants,  technical  products.  The  k  = 
1,...  ,p  variables  are  time-dependent  biological 


responses,  linear  free  energy-related  (LFER)  pa¬ 
rameters,  quantum  chemical  indices,  substituent 
constants,  descriptors  of  the  quality  of  a  product. 
Formally,  the  design  is  equivalent  to  that  of  an  one- 
group  design  with  a  single  set  of  variables. 
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No. 


Name 


Formula 


Remarks 

Xi  is  the  measured 
backscatter-intensity  at 
the  sample  point  i 

Average  of  all  N  mea¬ 
sured  values 

Average  Deviation 
More  robust  estimator 
than  variance 

Standard  Deviation 
Square  root  of  variance 

Variance 

Measure  for  the  ’’variabil¬ 
ity”  of  data 

Skewness 

Characterizes  the  shape 
of  the  underlying  distribu¬ 
tion 

Kurtosis 

Measure  for  ’’peaked¬ 
ness”  or  ’’flatness”  of  a 
distribution _ 


Q{xi .  ,,xn) 
xm{xi  . .  .xat) 

ADev(xi . . .  xn) 

a{xi . .  .xyv) 

V ar{xi . . .  xm) 

Skew{xi . .  .xn) 

Kurt{xi . .  .x^) 


f  ^min  \ 
\^max  / 

1  ^ 

t=l 


1  ^ 
^  1=1 


A 


N 


N- 


i—1 


a 

i=i 


(  Jl 

- 


Table  1 :  Feature  variables  extracted  from  backscatter-measurement 


3.1  Assumptions 

Design  Robustness. 

Please,  take  care  that  for  most  designs  the  degree 
of  freedom  due  to  error, 

ng  =  n  —  p  —  1 

is  equal  to  or  larger  than  the  degree  of  freedom  due 
to  hypothesis, 

nh=P 

The  variables  are  continuously  distributed  or  dis¬ 
crete  random  variables.  Whenever  possible,  do  not 
employ  a  mixture  of  measured  variables  and  qual¬ 
itative  ones,  because  the  resulting  categorization 
may  induce  a  grouping  effect.  For  example,  if  a 
variable  includes  only  ones  and  zeroes,  two  classes 
of  sizes  iVi(the  number  of  ones)  and  7V2(the  num¬ 


ber  of  zeroes)  may  lead  to  artificial  clustering,  mix¬ 
tures  of  distributions,  etc.  A  careful  check  of  the  re¬ 
sults  is  then  necessary  to  avoid  wrong  conclusions. 

Model  Robustness. 

The  analysis  is  scale  sensitive.  Outliers  are  ex¬ 
pected  to  have  improportionally  heigh  weight  in 
influencing  the  orientation  of  objects  in  a  multi¬ 
dimensional  pattern  space.  This  is  an  advantage 
because  unusual  effects  can  so  be  discovered  very 
rapidly. 

Hypothesis  Testing. 

The  method  is  perfectly  general;  it  involves  no  as¬ 
sumptions  on  the  underlying  distribution  model. 
Inasfar,  it  is  a  nonparametric  approach.  However, 
for  hypothesis  testing,  it  is  only  optimal  if  multi¬ 
variate  normality  is  assumed.  Inasfar,  the  proba¬ 
bilistic  principal  component  analysis  is  parametric. 
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Effect 

Group 

Obs. 

vin 

Q 

Ave 

ADev  SDev  SVar  Skew 

Kurt 

1 

V 

-20.32 

-13.20 

1.47 

2.08 

4.34 

-0.26 

4.87 

2 

n 

-17.43 

-10.83 

0.79 

1.37 

1.87 

3.39 

26.88 

Seating 

1 

3 

n 

-16.85 

-9.86 

1.04 

1.61 

2.60 

2.77 

13.81 

4 

n 

-16.42 

-9.35 

0.86 

1.35 

1.83 

3.49 

22.95 

5 

n 

-16.85 

-9.91 

0.90 

1.41 

2.00 

2.85 

20.16 

6 

7^ 

-16.94 

-10.29 

0.82 

1.37 

1.86 

3.53 

25.95 

7 

V 

-14.60 

-6.17 

0.63 

1.07 

1.14 

4.90 

44.00 

8 

n 

-14.66 

-6.29 

0.59 

1.06 

1.12 

4.59 

44.82 

Fog 

2 

9 

n 

-14.60 

-6.12 

0.62 

1.09 

1.18 

4.70 

40.89 

10 

7^ 

-14.49 

-5.73 

0.54 

0.97 

0.94 

4.99 

50.98 

11 

n 

-14.77 

-6.19 

0.61 

1.05 

1.10 

3.79 

39.56 

12 

n 

-14.70 

-6.02 

0.61 

1.06 

1.12 

3.84 

37.33 

13 

V 

-14.44 

-6.30 

1.25 

1.57 

2.46 

1.73 

7.98 

14 

7^ 

-14.68 

-6.66 

1.31 

1.62 

2.61 

1.64 

7.11 

Holes 

3 

15 

-14.31 

-6.23 

1.26 

1.57 

2.45 

1.76 

8.15 

16 

n 

-14.22 

-5.68 

1.63 

1.92 

3.70 

0.94 

1.60 

17 

n 

-14.26 

-6.38 

1.32 

1.62 

2.63 

1.63 

6.55 

18 

n 

-14.32 

-6.44 

1.29 

1.61 

2.58 

1.71 

7.04 

19 

V 

-14.50 

-7.73 

0.47 

0.93 

0.87 

8.66 

109.28 

20 

n 

-15.63 

-9.03 

0.67 

1.12 

1.24 

5.28 

54.42 

Seating 

4 

21 

7^ 

-14.70 

-8.05 

0.46 

0.93 

0.87 

8.93 

112.31 

+  ‘ 

22 

n 

-15.25 

-8.66 

0.53 

1.04 

1.08 

7.17 

80.50 

Fog 

23 

7^ 

-15.96 

-9.58 

0.57 

1.09 

1.18 

6.64 

72.31 

24 

7^ 

-15.57 

-8.78 

0.67 

1.19 

1.43 

4.97 

44.10 

Table  2:  Classification  data  {V:  prediction  set,  IZ:  recognition  set) 


3.2  Goal  of  Analysis 

1.  The  original  data  should  be  transformed  to 
asymptotically  normally  distributed  scores 
with  unit  variances  and  zero  means  in  order 
to  make  variables  with  different  variances 
and/or  meaning  commensurable,  regardless 
of  the  fact  that  the  original  measurements 
may  be  ’’distorted”  when  all  variables  are 
"treated  equally”.  Furthermore,  the  result¬ 
ing  new  variables  should  be  independently 
distributed  {completely  uncorrelated,  orthog¬ 
onal).  This  allows  a  better  interpretation 
by  illustrating  graphically  the  results,  and 
has  strong  advantages  if  subsequent  analyses 
should  be  made. 

2.  The  number  of  original  variables  should  be 
reduced  to  satisfy  the  principle  of  parsinomy 
(to  increase  the  test  power  of  subsequent 
analyses)  without  remarkable  loss  of  infor¬ 
mation.  Thus,  the  new  variables  should  be 


arranged  in  order  of  decreasing  contributions 
to  the  information  content  of  the  variables  to 
get  a  hierarchy  ("dominant  is  first,  least  is 
last”). 

3.  Internal  relationships  within  the  original 
variables,  and  between  the  original  and  new 
variables,  should  easily  be  discovered. 

3.3  Application  Examples 

Application  examples  are  so  many-fold  that  only 
some  may  here  be  collected. 

1.  Discovery  of  an  unknown,  common,  syn¬ 
thetic  factor  (also  called  metameter,  compos¬ 
ite  parameter,  overall  parameter)  of  a  pro¬ 
file  of  variables.  Usually,  the  new  parameter 
may  be  regarded  as  linear  combination  of  the 
original  variables.  In  such  case,  it  is  the  first 
(dominant)  principal  component  function.  It 
should  be  emphasized  that  linearity  concerns 


Figure  1:  Typical  backscatter-curves  of  the  four  kinds  of  surface  damage 


Objects 

y2 

Y, 

1 

yn 

yn 

yip 

2 

y2i 

y22 

y2p 

n 

2/nl 

yn2 

ynp 

Table  3:  Design  Matrix  of  PCA 


only  the  mechanics. 

For  example,  the  variables  y,  xp'  enter  in  lin¬ 
ear  combination  if  the  transformation  Yi  = 
y,  I2  =  'iP  is  applied,  yielding 

J^(Yi,y2)  =  ao  +  aiyi  +  a2^2 

where  ao  to  02  are  coefficients  to  be  esti¬ 
mated. 

2.  Discovery  of  the  structure  of  a  single  vari¬ 
able  which  may  be  thought  as  a  mixture  of 
intercorrelated  variables. 

3.  Application  of  diagnostic  statistics,  such 
as  of  (i)  outlying  observations,  (ii)  multi- 
collinear  data  structures,  (iii)  homogeneous 
subsets  of  objects  (clusters,  groups)  that  are 
scattered  around  a  central  value,  (iv)  homo¬ 
geneous  subsets  of  variables. 

4.  Use  of  the  components  in  subsequent  analy¬ 
ses,  such  as  in  (i)  regression  analysis  and  (ii) 
Euclidtm  distance  analysis  (in  this  case,  the 
distances  are  invariant  under  rotation  of  the 
axes). 

3,4  Algorithms 

3.4.1  Underlying  Formalisms 

1.  Let  Y  be  the  (n,p)-matrix  of  the  design  of 
Table  3, 

Y  =  {Vh)  =  {Vhk) 

where  h  =  1,...  ,n  denotes  the  objects, 
k  ~  1, . . .  denotes  the  number  of  vari¬ 
ables  that  determine  the  dimensionality  of  a 
distribution.  The  matrix  may  also  be  written 
in  terms  of  the  Yk  variables, 

where  denotes  the  transposed  matrix, 
and  the  parentheses  symbolize  an  order  of 
variables.  In  contrast,  {. . .  }  denotes  an  arbi¬ 
trary  sequence  of  parameters.  The  so-called 
within-group  matrix  becomes  then  in  this 
one-sample  design 

W  =  f:(y,-y)(y,_y)r 

/l=l 


where  the  p-dimensional  vector  of  arith¬ 
metic  mean  values  is  defined  by 

y  =  -Yyh 

^  h=l 

The  (p,p) -dimensional  variance-covariance 
or  dispersion  matrix  becomes  then 

s  =  -l-W 

n  -  1 

where  the  main-diagonal  elements  contain 
the  variances,  and  the  off-elements  include 
the  covariances.  This  matrix  plays  a  funda¬ 
mental  role  in  all  approaches.  Its  diagonal 
matrix  may  be  written  as 

diag{S)  =  D 

(it  includes  the  variances  in  the  main  diago¬ 
nal  while  the  off-elements  are  zeroes).  The 
correlation  matrix  R  is  the  second  matrix  of 
fundamental  importance;  it  is  defined  by 

R=  (D-5S)(D-5) 

In  this  special  case,  R  is  a  symmetric  (p,p)- 
matrix  which  simplifies  considerably  the 
computation  of  its  eigenvalues  and  eigenvec¬ 
tors. 

As  already  described,  the  /  =  1, . . .  ,  5  <  p 
non-zero  eigenvalues  A/  are  ordered,  and  its 
resulting  5-dimensional  vector  may  be  writ¬ 
ten  as 

A^  =  (AiA2...A,) 

Let  I  be  a  (p,p)-dimensional  identity  matrix 
(a  matrix  where  the  main  diagonal  contains 
ones  while  the  off-elements  include  zeroes). 
Then,  the  eigenvalues  are  obtained  by 

det(R  —  AI)v/  =  0 

where  det  denotes  the  determinant.  We  re¬ 
member  that  the  determinant  of  R  is  given 
by 

p 

det(R)  =  n 
/=i 

and  the  determinant  of  the  inverse  R~^  is 

det(R-i)  =  n 
/=! 
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These  properties  are  useful  for  procedure^ 
applied  to  other  techniques  than  PCA.  The 
eigenvectors  v j  are  obtained  by 

(R- A/I)  =  0 

where  the  normalization  condition  is 


(v/)^v/  =  1 


because  R  is  a  symmetric  matrix.  Remem¬ 
ber  also  that  the  angle  between  two  eigenvec¬ 
tors,  say,  for  example,  and  can  easily 
be  determined  by 


COs(Vq,V/3)  = 


I  Vail  llv^ll 


For  example,  let  be  the  first  eigenvector 
of  a  principal  component  analysis  of  group 
A,  and  let  v/j  be  the  first  eigenvector  of 
group  B,  This  offers  the  possibility  to  com¬ 
pare  groups  by  angular  statistics,  but  that  is 
not  a  matter  of  concern  here. 


If  only  the  ’’relevant”  eigenvalues  and  eigen¬ 
vectors  are  used,  the  resulting  correlation 
matrix  Rdeg  deviates  slightly  from  R,  and 
the  difference 


Re  —  ^  ^deg 

is  the  residual  matrix  which  should  only  con¬ 
tain  zero  or  near-zero  values. 

3.  In  a  third  stage  of  numerical  computation, 
the  correlation  matrix  M  between  original 
variables  and  components  may  be  estimated 
directly  or  by  the  simple  equation 

=  L5  V 

Re-arranging  the  equations,  we  can  now  re¬ 
turn  to  the  original  data  matrix  if  all  com¬ 
ponents  are  included,  or  to  a  degenerated  or 
adjusted  original  matrix  if  at  least  one  com¬ 
ponent  is  omitted. 


2.  The  second  step  is  the  calculation  of  the  ma¬ 
trix  of  coefficients  of  principal  component 
functions, 

L-iV 

where  L  is  the  diagonal  matrix  of  the  eigen¬ 
values,  and  V  is  the  matrix  of  eigenvec¬ 
tors.  As  already  mentioned,  there  are  good 
reasons  to  make  the  original  measurements 
commensurable.  This  may  be  done  by  the 
transformation 

Vh  =  D"5(y;j  _y). 

The  resulting  matrix  Y  has  unit  variances 
and  zero  means.  The  scores  of  the  principal 
component  function  are  then  computed  by 

F  =  YP 

and  the  covariances  resp.  correlations  be¬ 
tween  the  components  F/  are  zero.  In  other 
words,  the  dispersion  matrix  of  the  compo¬ 
nents  is  an  identity  matrix. 

Spectral  decomposition  of  the  correlation 
matrix  means  that  R  can  be  expressed  by 

R-  =  E>^/v/vJ 

/=i 

=  VLV^ 


^  —  Vk  ^kiQhk) 

=  ,p) 

where  yk  and  Sk  are  the  kth  mean  value  and 
standard  deviation  (square  root  of  the  kXh 
variance)  of  the  original  measurements,  and 

Q  =  (qkk)  =  FM^ 

The  adjusted  variables  may  loosely  be  com¬ 
pared  with  the  adjusted  means  of  covariance 
analysis. 

4.  The  squared  distance  between  two  objects, 
termed  A  and  B  for  simplicity,  is  the  squared 
Euclidem  distance 

d2(A,B)  =  ||Fa-Fb|P 

Thus,  principal  component  analysis  may  also 
be  applied  for  discovering  subsets  of  objects 
around  the  gravity  center  or  other  centers, 
and  has  strong  relationships  to  cluster  anal¬ 
ysis  and  multidimensional  scaling  (principal 
coordinate  analysis).  It  should  be  empha¬ 
sized  that  the  number  of  components  applied 
to  an  actual  analysis  can  be  determined  by 
the  user  himself. 
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3.4,2  Decision  Procedure 

Test  of  Dimensionality  of  Original  Variables 
The  squared  multiple  correlation  coefficients 
within  the  variables,  called  internal  determination 
coefficients, 

Dk  =  l-Kk  =  l- 

must  be  significantly  different  from  zero  if  subse¬ 
quent  analysis  makes  sense.  Note  that  is  the  kih 
element  of  the  main  diagonal  of  R~^,  and  Ak  is  the 
likelihood-ratio  criterion. 

It  can  be  shown  that,  if  at  least  one  eigenvalue  A  f 
of  R  is  zero,  at  least  a  single  Dk  is  one.  The  null 
hypothesis  Hio  (at  least  one  internal  determina¬ 
tion  coefficient  of  the  population  is  zero)  is  rejected 
at  the  significance  level  a  if  the  test  statistic  TS  is 

TS^Dk>{l-  Aa)  =  0a 

where  Aq  is  the  likelihood-ratio  criterion,  6^  is  the 
largest-root  criterion,  and  the  degrees  of  freedom 
are  here 

nh=p-l,ne  =  n-p 

Rejecting  this  null  hypothesis  suggests  that  at  least 
one  multiple  correlation  coefficient  of  the  original 
data  is  significant. 

Test  of  Dimensionality  of  Components  The  big 
question  of  PCA  is  how  the  essential  dimension¬ 
ality  should  be  determined.  We  have  solved  this 
problem  by  testing  the  eigenvalues  of  R  and  the 
correlation  coefficients  of  M. 

a.  Test  of  Eigenvalues.  Let  A/  be  the  estimator 
of  the  eigenvalue  a/  of  the  population.  Then, 
the  global  null  hypothesis  becomes 

Hoi  {ai=  as). 

This  is  equivalent  to  the  assumption  that  the 
correlation  matrix  of  the  population  is  an 
identity  matrix.  ’’Accepting”  this  null  hy¬ 
pothesis  would  suggest  the  existence  of  a 
spherical  symmetry  or,  respectively,  reject¬ 
ing  this  hypothesis  would  suggest  that  at 
least  one  correlation  coefficient  of  the  orig¬ 
inal  data  is  significant. 


The  null  hypothesis  is  rejected  at  the  signifi¬ 
cance  level  a  if 

TS  =  q\n(ao/Po)^>X%a 

where  (ao//?o)  is  the  likelihood-ratio  crite¬ 
rion.  The  test  statistic  is  asymptotically 
with 

fi  =  ^{p-K  +  2){p-K-l) 

degrees  of  freedom,  and  K  is  the  number  of 
eigenvalues  to  be  removed  [K  =  0  for  test¬ 
ing  Hoi  )•  Furthermore,  we  obtain 

and 

/  p  \1/(P-K) 

^0  =  I  n  Ajf  j 
\k=o  ) 

Thus,  for  testing  Hoi,  the  procedure  is  sim¬ 
plified  to  ao  “  1  and 

/3o  =  det(R)(^ 

We  must  still  define  q  and  r.  For  small  sam¬ 
ple  sizes,  the  two  arguments  are  defined  as 
follows: 

r  =  l,q={n-l){p~K) 

For  larger  samples,  a  more  conservative  de¬ 
cision  may  be  applied: 

r  =  p-K,  q  =  (n-l)-^(2p+5)-^i<: 

If  Hoi  is  rejected,  we  can  test  the  significance 
of  the  individual  eigenvalues  resp.  the  local 
null  hypotheses, 

Ho2  (<7i  =  0,  a2  =  0, cTj  =  0) 

First,  Ai  is  excluded  {K  =  1),  and  TS  and  p 
must  be  determined  as  described  above.  The 
desired  test  statistic  for  examining  H02  ::= 
((Ti  =  0)  is  then  given  by  the  difference  of 
TS  related  io  K  =  0,  and  of  TS  related  to 
K  =  1.  In  strict  analogy,  the  difference  of 
the  degrees  of  freedom  is  determined.  Then, 
the  significance  point  is  taken  from  tables, 
and  the  procedure  is  continued  if  the  first 
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eigenvalue  is  significant.  In  strict  analogy, 
the  subsequent  eigenvalues  are  examined. 

Remark.  Remember  the  test  of  significance 
of  eigenvalues  is  sensitive  to  departures  from 
normality. 

The  information  content  of  an  individual 
component  can  be  visualized  by 

inf (%)Pca  -  A/(100%)/fr(R) 

Because  R  is  a  correlation  matrix,  its  trace  is 
equivalent  with  the  sum  of  eigenvalues  or  the 
number  s  of  non-zero  eigenvalues, 

ir(R.)  =  EA/  =  . 

f=l 

and  the  cumulative  percentage  cum(%)  PCA 
is  its  successive  sum. 

b.  Test  of  Correlation  Coefficients.  Up  to  now, 
a  joint  probability  density  function  of  sam¬ 
ple  correlation  coefficients  is  not  available  in 
a  closed  form.  However,  a-priori  statistics 
can  be  employed. 


Setting  p  for  any  of  the  coefficients  of  a  cor¬ 
relation  matrix,  and  let  r  be  its  estimator. 
Then,  the  null  hypothesis 

®]o3  “=  {p  =  0) 

is  rejected  at  the  significance  level  a  if  the 
test  statistic  is 

r5=|r-|>P0  =  (iU;„/(fL/,;a  +  /2))' 
where  the  degrees  of  freedom  are 
h=Pj2^n-p-l 
with  p  as  number  of  variables  under  study. 
c.  Diagnostic  Statistics 

1.  Normality.  In  general,  it  can  be  stated 
that  parametric  methods  are  relatively 
robust  against  departures  from  normal¬ 
ity  if  the  principles  of  design  robustness 
are  satisfied  approximately.  Neverthe¬ 
less,  in  some  cases  it  is  necessary  to  ex¬ 
amine  statistically  this  assumption.  A 
rejection  of  the  hypothesis  of  normal¬ 
ity  means: 


(i)  there  is  a  non-normality  or 

(ii)  we  have  a  mixed  distribution  con¬ 
sisting  of  unimodal  distributions 
with  different  variances  resp.  dis¬ 
persion  matrices  (grouped  obser¬ 
vations  within  a  one-sample  de¬ 
sign),  or 

(iii)  points  (i)  and  (ii)  are  valid.  In 
particular,  we  find  this  property  if 
one-group  designs  contain  various 
subgroups  (clusters  with  homoge¬ 
neous  distributions). 

For  a  small  sample  size  {n  <  30),  the 
scores  are  ranked  and  then  regressed 
against  the  ordinate  values  of  the  cu¬ 
mulative  normal  distribution  (95%  con¬ 
fidence  limit).  If  the  correlation  coeffi¬ 
cient  is  significant,  the  intercept  is  ap¬ 
proximately  50%,  and  a  common  slope 
can  be  observed,  normality  may  be  hy¬ 
pothesized. 

For  larger  sample  sizes.,  a  frequency 
histogram  shows  a  bell-shaped  distri¬ 
bution,  that  is,  the  cumulative  frequen¬ 
cies  v^.  the  upper  class  limits  leads  to 
a  straight  line  in  the  normal  probabil¬ 
ity  paper  of  Hazen.  Rules  are  available 
for  the  assignment  of  sample  members 
into  classes  and  the  estimation  of  class 
limits.  Another  possibility  is  the  esti¬ 
mation  of  skewness  gz  and  kurtosis  g^. 
For  large  samples  (asymptotic  case), 
the  skewness  becomes: 


skew.  = 


nU=iiyh-y)^ 

(n  —  l)(n  —  2)5^ 


(missing  if  the  standard  deviation  s  is 
zero,  or  of  the  sample  size  n  is  less  than 
3).  We  use  the  standardized  skewness. 


skew. 


because  it  is  asymptotically  normally 
distributed  (for  example,  an  absolute 
value  of  ^3  that  is  equal  to  or  larger  than 


1,96  means  that  the  hypothesis  of  nor¬ 
mality  must  be  rejected  at  the  5%  sig¬ 
nificance  level  or  less).  For  large  sam¬ 
ples,  the  kurtosis  becomes 


hurt. 


n(n  +  l)ELi(y/i-y^) 

(n  l)(n  -  2){n  -  3)5^ 
3(n-l)2 
(n  —  2)(n  —  3) 


(missing  if  5  =  0  orn  <  4).  We  pro¬ 
pose  to  use  the  standardized  version, 

kurt. 

9  a  = 

because  this  statistic  is  asymptotically 
normally  distributed. 

We  employ  also  a  suitable  compromise 
between  the  two  procedures.  It  is  sup¬ 
posed  that  represents  ft  =  1, . . .  ,  n 
independent  observation  of  a  continu¬ 
ous  random  variable  Yk  for  each  of  the 
A;  =  1, . . .  ,  n  variables,  with  a  cumula¬ 
tive  distribution  function  {cdf) 

where  /ijt  and  ajt  are  the  population 
means  and  standard  deviations.  The 
probability  plot  is  a  scatter  of  the  or¬ 
dered  sample  values 

yu  <  •  •  •  <  ynifcVS.Zifc  <  ...<Znk 

where 


zhk  =  ^  Hi^hk) 


with  TTfik  as  an  empirical  estimate  of  the 
cdf.  Several  values  for  Whk  have  been 
suggested;  we  use  here 


'^hk  — 


jh-l) 

+  i) 


(foreachft  =  1,. . .  ,p) 


If  T  is  the  cdf  of  a  normal  distribu¬ 
tion,  the  resulting  scatter  plot  is  called 
normal  probability  plot,  and  the  Zhk 


are  called  normal  scores.  Statistically 
speaking,  normality  must  be  hypothe¬ 
sized  if  the  correlation  coefficient  be¬ 
tween  the  ranked  original  data  and  the 
normal  scores  is  significant  and  if  there 
is  a  common  straight  line.  It  must  be 
emphasized  that  a  graphical  plot  is  an 
important  means  to  get  a  survey  on 
the  role  of  individual  sample  members 
(outliers,  clustering  etc.). 

Deviations  from  normality  have  usually 
no  serious  consequences  if  design  and 
model  robustness  exists,  and  if  there  is 
a  symmetry  of  distribution.  In  some 
cases,  an  apparent  normality  may  con¬ 
sist  of  a  mixture  of  distributions. 

Compensating  for  non-normality  is  to 
employ  a  transformation  (which  nor¬ 
malizes  and  works  often  as  variance- 
stabilizing  transformation,  too)  to  each 
variable.  Although  the  effect  to  mul¬ 
tivariate  normality  is  general  unknwon 
and  this  univariate  transformation  may 
only  achieve  marginal  normality,  it  has 
the  advantage  of  simplicity.  The  limita¬ 
tion  of  tests  of  multivariate  normality  is 
that  their  power  is  still  to  be  shown;  fur¬ 
thermore,  tables  of  critical  quantiles  in¬ 
clude  only  the  uni-  and  bivariate  case, 
in  general.  Thus,  it  is  difficult  to  test 
multivariate  normality  with  the  excep¬ 
tion  of  very  particular  designs.  We  pro¬ 
pose  to  examine  a  jointly  distributed 
composite  parameter,  such  as  scores 
of  the  significant  principal  component 
functions. 

2.  Randomness  of  Data.  Randomness  of 
data  is  the  basis  of  considering  each  se¬ 
lection  of  objects  to  be  equally  likely. 
Various  tests  are  available.  We  employ 
the  serial  correlation  coefficient.  In 
most  cases,  randomness  is  obtained  by 
experimental  designing  so  that  mutual 
dependence  of  the  sample  members 
(autocorrelation)  can  be  excluded.  In 
some  cases,  a  measurement  or  residual 
of  regression  analysis  may  depend  on 
the  previous  values.  Examples:  Growth 
curves,  time  series,  nonlinear  models, 
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step-by~step  optimization  procedures 
(the  result  depends  then  on  the  previ¬ 
ous  results).  The  estimators  will  no 
longer  be  minimum  variance  estima¬ 
tors.  Compensating  for  randomness  is 
based  on  the  use  of  random  number  ta¬ 
bles,  and  of  autoregressive  transforma¬ 
tions. 


3.  Outliers.  Let  R/^  be  any  correlation 
matrix  without  the  sample  member  —h, 
let  R  be  the  correlation  matrix  of  all 
sample  members.  Then,  the  smallest 
scattering  ratio 


R-h  —  min 


det(R_/t) 

det(R) 


} 


is  an  indicator  that  “/i  is  an  outlier  (the 
range  of  R-h  varies  from  0  to  1).  If 
the  empirical  value  R-h  is  less  than  the 
tabulated  quantile  r^,  the  null  hypoth¬ 
esis  (no  outlier)  must  be  rejected.  It 
should  be  emphasized  that  this  test  is 
very  radical  (’Very  sharp”). 


3.5  Computational  Examples 
3.5.1  Test  example 

The  question  is  now,  how  PCA  can  be  used  as  a  tool 
for  machine  learning  and  pattern  recognition.  The 
way  we  employed  is  pointed  out  by  the  following 
small  example.  Imagine  that  we  have  four  different 
pattern  in  our  data  set.  These  pattern  can  be  con¬ 
ceived  to  be  obtained  from  the  same  input  data  X 


which  have  been  subjected  to  certain  data  transfor- 
rnations  T.  For  simplicity  we  choose: 

•  Ta  =  Y  (i.e.  the  output  data  are  equal  to  the 
input  data) 

•Tp  =  Y^ 

•  T^  =  y/Y 

•  Ts  =  ln(y) 

Using  the  set  of  input  data  Y  ~  (1, 2, 3)  the  (4,3)- 
design  matrix  of  PCA  becomes: 


1.00 

2.00 

3.00  \ 

1.00 

4.00 

9.00 

1.00 

1.41 

1.73 

0.00 

0.69 

1.09  ) 

Figure  2  shows  a  listing  in  extracts  produced  by 
the  software  package  Mdad:  Multivariate  Design, 
Analysis  and  Diagnosing  which  was  developed  in 
the  research  group  of  the  author.  The  signs  of  the 
and  2^^  principal  component  are: 


/+ 

f  Ta\ 

+  + 

—  — 

% 

+) 

\  Ts  J 

where  C  is  the  classification  matrix.  The  four  dif¬ 
ferent  pattern  are  fully  recognized.  The  features  are 
consistent  in  this  simple  case. 


MDAD  1.02a  (c)  -  Multivariate  Statistics 

Principal  Component  Analysis 


Matrix  TEST-EXPL .MAT 
1 

1  1.00 

2  1.00 

3  1.00 

4  0.00 

Means 

0.75 

Standard  Deviations 
0.50 


2  3 

2.00  3.00 

4.00  9.00 

1.41  1.73 

0.69  1.10 


2.03  3,71 


1.42  3.62 


Correlation  Matrix (cO  =  1.00) 
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1 

2 

3 

1  1.00 

0.63 

0.48 

2  0.63 

1.00 

0.98 

3  0.48 

0.98 

1.00 

Eigenvalues 

2,42  * 

0.58 

0.96e-03 

Eigenvectors 

1 

2 

3 

1  0.48 

-0.86 

0.15 

2  0.63 

0.23 

-0.74 

3  0.60 

0.45 

0.66 

Principal  Component 

Function 

1 

2 

3 

1  0.31 

-1.13 

4.75 

2  0.41 

0.30 

-23.89 

3  0.39 

0.59 

21.24 

Principal  Component 

Function 

Scores 

1 

2 

3 

1  0.07 

-0.69 

-1.33 

2  1.29 

0.72 

0.26 

3  -0.23 

-1.02 

1.08 

4  -1.13 

0.99 

-6.82e-03 

The  global  Null  Hypothesis  is  rejected:  TS  (19.83)  >  ChiSquare  (15.09) 
General  Alpha  level  is  equal  to  1%  (Values  marked  by  an  asterisk  are 
significant) 

Test  of  Eigenvalues  was  performed  automatically 
Number  of  Non-Zero  Eigenvalues  is  3 
Number  of  significant  Eigenvalues  is  1 


Figure  2:  Listing  in  extracts 


3.5.2  Example 

Figure  3  shows  classification  results  for  the  exam¬ 
ple  according  Table  2. 

The  statistical  moments  were  computed  according 
to  Table  1,  and  the  resulting  design  matrix  was  sub¬ 
jected  to  a  principal  component  analysis.  The  signs 

4  Pattern  Recognition  Methods 

4.1  Discriminant  Analysis 

4.1.1  General  purpose  and  computational  ap- 
proach 

General  purpose  Discriminant  analysis®’^®  is 
commonly  used  to  determine  which  variables  dis¬ 
criminate  best  between  two  or  more  groups.  The 


of  principal  components  were  used  for  classifica¬ 
tion,  no  error  occured.  Furthermore  the  results  of 
PCA  indicate  that  the  features  form  a  consistent  de¬ 
sign  matrix  from  the  statistical  point  of  view.  That 
means,  all  pattern  recognition  approaches  should 
be  able  to  detect  the  four  error  pattern. 


basic  idea  underlying  discriminant  analysis  is  to  de¬ 
termine  whether  groups  differ  with  regard  to  the 
mean  of  a  feature  variable.  This  variable  is  then 
used  to  predict  group  membership. 

Computational  approach  Computationally,  dis- 
riminant  analysis  is  very  similar  to  the  analysis  of 
variance  {AN OVA/MAN  OVA).  In  the  case  of  a  sin¬ 
gle  variable,  the  final  significance  test  of  whether  or 
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Sign  of  PC  1  and  PC  2 


<►  -e-o-  ^ 


i  i 


First  PC 
Second  PC 


I 

I 


^w0r.  ^ 


o.  4>-3.  0-0-  •© 


10  15 

Observation# 


Figure  3:  and  2^^  principal  component  vs.  No.  of  observation 


not  a  variable  discriminates  between  groups  is  the 
F-test.  F  is  computed  as  the  ratio  of  the  between- 
groups  variance  in  the  data  over  the  pooled  within- 
group  variance.  If  the  between-group  variance  is 
significantly  larger  than  the  within-group  variance 
then  there  must  be  significant  differences  between 
means.  Usually,  several  variables  are  included  in 
a  discriminant  analysis  -  we  actually  use  7  feature 
variables  in  our  example.  Consequently,  we  have  a 
matrix  of  total  variances  and  covariances,  and  also 
a  matrix  of  pooled  within-group  variances  and  co- 
variances.  These  two  matrices  can  be  compared  via 
multivariate  F-tests  in  order  to  decide  whether  or 
not  there  are  any  significant  differences  between 
groups.  Even  though  the  computations  with  mul¬ 
tiple  variables  are  more  complex,  the  principal  rea¬ 
soning  is  still  the  same,  and  namely  that  we  are 
looking  for  significant  mean  differences  between 
groups. 

Before  starting  the  computational  procedures  it 
makes  always  sense  to  have  a  look  at  the  data  pro¬ 
files.  This  gives  a  first  impression,  if  discrimina¬ 
tion  of  multiple  groups  could  be  possible.  In  Fig¬ 
ure  4  the  7  feature  variables  are  displayed  in  form 
of  ’’data  canyons”. 

It  can  be  seen  that  the  effects  ’’Seating”  and 
’’Holes”  have  rather  different  profiles,  while  the 
separation  of  ”Fog”  and  ’’Seating  +  Fog”  could 


be  difficult.  The  main  difference  which  is  clearly 
recognizable  is  the  form  of  the  ’’ground”  of  the 
’’canyon”.  In  the  case  of  ’’Seating  +  Fog”  the 
ground  is  shaped  like  a  gable,  while  the  ”Fog”  pro¬ 
files  show  a  straight  line  tilted  from  left  to  right. 
However,  discrimination  of  all  4  groups  should  be 
possible. 

Canonical  analysis  When  actually  performing  a 
multiple  group  discriminant  analysis  the  use  of 
canonical  discriminant  analysis  will  provide  some 
optimal  combinations  of  groups  and  variables  with 
respect  to  best  discrimination  between  groups.  In 
this  sense,  the  first  discriminant  function  provides 
the  most  overall  discrimination  between  groups, 
the  second  provides  second  most,  and  so  on.  More¬ 
over,  the  discriminant  functions  will  be  orthogo¬ 
nal,  i.e.  their  contributions  to  the  discrimination 
of  groups  will  not  overlap. 

4.1.2  Assumptions 

Normality  It  is  assumed  that  the  measured  data 
(backscatter-curves)  and  the  derived  feature  vari¬ 
ables  represent  a  sample  from  a  multivariate  normal 
distribution.  However,  violations  of  the  normality 
assumption  are  usually  not  severe,  meaning  that  the 
results  of  analysis  ^e  still  reliable. 
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Homogeneity  It  is  assumed  that  the  vari¬ 
ance/covariance  matrices  are  homogenous  across 
groups.  The  multivariate  Box  M-test  can  be  used. 
If  deviations  from  homogeneity  are  detected,  this 
should  not  be  taken  too  seriously.  However,  our 
data  are  homogenous. 

Correlations  between  means  and  variances 
Validity  of  discriminant  analysis  can  be  influenced 
when  means  and  variances  across  groups  are  cor¬ 
related.  If  there  is  a  large  variability  in  a  group 
with  particularly  high  means  on  some  variables, 
then  those  means  are  not  reliable.  Significance  tests 
will  fail  in  this  case.  The  experimental  data  of  our 


example  do  not  show  this  property. 

Categorized  normal  probability  plot  This  plot 
is  constructed  as  follows.  First,  the  deviations 
from  the  mean,  i.e.  the  residuals,  are  rank  or¬ 
dered.  From  these  ranks  standardized  values  of  the 
normal  distribution  based  on  the  assumption  that 
the  data  come  from  a  normal  distribution  are  com¬ 
puted.  These  values  are  plotted  on  the  y-axis.  If 
the  observed  residuals  which  are  plotted  on  the  x- 
axis  are  normally  distributed  then  they  will  not  de¬ 
viate  from  the  line.  If  there  is  a  general  lack  of  fit, 
and  the  data  do  not  form  a  clear  pattern,  then  non¬ 
normality  may  be  assumed. 
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Figure  4:  Data  profiles 


UCVT 

No. 

Perc. 

of 

Var. 

Intercept 

Q 

1 

Ave 

2 

Coefficients  for 
ADev  SDev 

3  4 

SVar 

5 

Skew 

6 

Kurt 

1 

1 

60.79 

-67.02 

2.22 

-2.43 

45.61 

72.55 

-32.03 

-0.48 

0.20 

2 

30,96 

-94.98 

-6.28 

6.73 

1.63 

71.59 

-24.21 

-1.75 

0.18 

3 

8.25 

-195.26 

-11.69 

6.57 

7.62 

75.24 

-24.98 

0.86 

0.11 

Table  4:  Unstandardized  canonical  discriminant  function  coefficients 
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Group 

No. 

ValueofWCPJ’No. 

1  2  3 

1 

1.1860 

-4.1838 

4.1985 

2 

-10.4797 

5.3353 

0.0329 

3 

10.4154 

5.1644 

-0.8131 

4 

-1.1217 

-6.3159 

-3.4183 

Table  5:  Unstandardized  canonical  discriminant  functions  evaluated  at  group  means 


4.1.3  Canonical  discriminant  functions  for 
multiple  groups 

Classification  is  performed  by  using  feature  vari¬ 
ables  and  the  3  unstandardized  canonical  discrim¬ 
inant  functions  UCVT.  The  coefficients  of  these 
functions  can  be  seen  in  Table  4.  The  values  of  the 
UCVT  evaluated  at  group  means  (cf.  Table  5)  give 
the  following  pattern  of  signs: 


/+  -  +N 

/  Seating  \ 

-  -1-  -1- 

Fog 

4-  -1-  - 

Holes 

- ) 

Y  Seating  +  Fog  J 

Of  course,  this  is  only  a  ’’rule  of  thumb”  for  classi¬ 
fication,  because  the  proper  cut-off  points  obtained 
from  the  multivariate  procedure  must  be  used.  To 
illustrate  the  possibility  of  real-time  implementa¬ 
tion  of  the  algorithm,  the  compuation  of  the  first 
discriminant  function  for  the  group  mean  of  the 
first  group  -  ’’Seating”  -  shall  be  demonstrated 
now: 

7 

UCVTi^i  =  ^1,1,0  +  X  Xi 

i=l 


where  (i  =  0, 1, ...  7)  are  the  coefficients  of 
the  UCVTi^i  and  xi  (i  =  1,2,...  7)  are  the  fea¬ 
ture  variables.  Inserting  the  numerical  values  we 
obtain: 

UCVTx^i  = 

-  67.0162 

+  2.2168  X  (-16.8968) 

-  2.4335  X  (-10.0484)  +  45.6078  x  (0.8822) 

-h  72.5525  X  (1.4222)  -  32.0300  x  (2.0326) 

-  0.4794  X  (3.2028)  +  0.2017  x  (21.9470) 

The  result  of  these  7  multiplications  and  8  addi¬ 
tions  is  1.186.  Consequently,  classification  itself  is 
very  simple  and  can  be  performed  in  real-time  even 
with  moderate  computational  power. 

Table  6  shows  the  results  of  prediction,  i.e.  classifi¬ 
cation  of  the  observations  1,  7, 13  and  19.  No  error 
occured,  100%  accuracy  was  achieved. 


Obs. 

No. 

ValueofWCP.FNo. 

1  2  3 

Pred.  Grp. 

1 

0.4690 

-8.4919 

15.2796 

w 

7 

-8.3681 

4.2935 

0.3135 

2V 

13 

9.1518 

6.3696 

-0.4366 

3y/ 

19 

-1.2544 

-5.1226 

-5.6608 

47 

Table  6:  Unstandardized  canonical  discriminant  functions  evaluated  at  the  prediction  set 


4.1.4  Statistical  inference 

All  results  of  statistical  analyses  have  to  be  checked 
carefully  for  significance.  Table  8  shows  therefore 
a  listing  in  extracts  produced  by  the  software  pack¬ 


age  Mdad  -  Multivariate  Design,  Analysis  and 
Diagnosing  which  was  developed  in  the  research 
group  of  the  authors.  This  listing  clearly  indicates 
the  overall  significance  of  the  discriminant  analy- 
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sis,  univariately  as  well  as  multivariately. 

Table  7  gives  an  impression  concerning  the  dis¬ 
criminatory  power  of  the  single  feature  variables 
for  the  overall  performance.  Table  9  shows  the 
power  of  the  single  feature  variables  for  the  dis¬ 
crimination  of  the  single  groups. 


A  main  point  is,  that  all  feature  variables  are 
needed  for  the  discrimination  of  the  4  error  pattern, 
even  if  some  of  them  are  not  significant  for  the  dis¬ 
crimination  of  certain  groups. 


Values  of  mean  SCVT -Coefficients  for  all  feature  variables 

Q  Ave 

ADev 

SDev 

SVar 

Skew 

Kurt 

1  2 

3 

4 

5 

6 

7 

1.1959  1.7329 

2.6932 

7.1225 

8.1873 

0.6406 

1.8474 

Table  7:  Discrimination  power  of  the  feature  variables  referring  to  all  UCVT 
High  values  denote  a  high  discrimination  power 


4.2  Artificial  Neural  Networks 

4.2.1  Introduction 

Neural  Networks^^  have  arisen  from  analogies  with 
models  of  the  way  that  humans  might  solve  the  pat¬ 
tern  recognition  task.  However,  they  have  devel¬ 
oped  a  long  way  from  the  biological  roots.  Great 
claims  have  been  made  for  artificial  neural  net¬ 
works,  few  of  them  have  withstood  careful  checks. 
But,  nevertheless  neural  networks  have  influenced 
pattern  recognition  practice  very  strongly.  Al¬ 
though  there  is  no  full  theoretical  understanding 
of  how  they  work  at  the  moment,  dozens  of  soft¬ 
ware  packages  and  hardware  accelerator  cards  have 
emerged  on  the  market  and  disseminate  the  ideas  of 
pattern  recognition  via  artificial  neural  networks  . 


type  of  network  to  model  any  deterministic  process 
to  a  certain  degree,  therefore  we  will  use  it  here  for 
the  purpose  of  pattern  recognition. 

Figure  5  actually  represents  a  function  acting  on  a 
vector,  where  the  neural  network  function  sends  the 
vector  in  to  the  vector  (2/1, 

in  R^.  Thus,  the  feedforward  network  can  be  rep¬ 
resented  as: 

y  =  Fi^) 

where  x  =  (xi, . . . , xjv)  and  y  =  (yi, . . : , yM). 
The  action  of  this  function  is  determined  in  a  spe¬ 
cific  way.  For  a  network  with  N  input  nodes,  H 
hidden  layer  nodes,  and  M  output  nodes,  the  val¬ 
ues  yk  are  given  by: 


The  most  widely  used  and  easily  recognized  neu¬ 
ral  network  is  the  so-called  feedforward  network. 
By  some  estimates,  it  accounts  for  90  percent  of  all 
neural  network  applications. 

Typical  applications  outside  the  area  of  pattern 
recognition  are  for  example:  predictions  of  the 
stock  market,  recognition  of  handwriting,  synthe¬ 
sis  of  speech  from  text,  driving  a  truck. 


4.2.2  Feedforward  Network  Operation 

Figure  5  shows  a  common  three-layer  feedforward 
architecture.  By  ’’three-layer”  is  meant  that  the 
network  has  an  input  layer,  one  hidden  layer,  and 
an  output  layer.  More  hidden  layers  can  be  used. 
In  general,  three  layers  are  sufficient  to  enable  this 


yk=  9 


H 


.J=l 


with  k  =  1 . . .  M. 

Variable  is  the  output  ’’weight”  from  hidden 
node  j  to  output  node  k,  and  ^  is  a  function  map¬ 
ping  R^  to  R^.  The  values  of  the  hidden  layer 
nodes  hj^j  =  1, . . . ,  if  can  be  computed  as  fol¬ 
lows: 


hj  —  a 


r  N 

Y 

U=i 


wf^Xi  -f  wj 


with  j  =  1, . , . ,  if .  Here,  wfj  is  the  input  ’’weight” 
from  input  node  i  to  hidden  node  j,  wJ  is  a  thresh¬ 
old  ’’weight”  from  an  input  node  which  has  the  con¬ 
stant  value  1  to  hidden  node  j  ,  and  Xi  is  the  value  at 


Multivariate  Statistics  -  Discriminant  Analysis 

Number  of  variables:  7  Number  of  groups:  4  Number  of  cases: 20 

Number  of  cases  in  groups:  5555 

Group  prior  probabilities:  0.25  0.25  0.25  0.25 

Number  of  unknown  observations  to  classify:  4 

Degrees  of  freedom  due  to  hypothesis /error  (nh) / (ne)  :  3/16 

Univariate  Wilks'  Lambda  and  F-Ratio 

gives  the  discrimination  capability  of  the  variables  via 
variance  shares . 

Alpha  level:  0.01  (0.05)  [0.1] 

F-Table  value:  5.2922  (3.2389)  [2.4618] 

The  Chisquare  statistics  was  transformed  into  a  F-ratio. 

A  variable  is  significant  if  the  F-ratio  >  F-table  value ( 3 , 16 , alpha) 
Var.  Wilks  F-Ratio  Significance 

No .  Lambda 


1 

0.07634 

64.53 

yes 

(yes) 

[yes] 

2 

0.05281 

95.65 

yes 

(yes) 

[yes] 

3 

0.07685 

64.07 

yes 

(yes) 

[yes] 

4 

0.11681 

40.33 

yes 

(yes) 

[yes] 

5 

0.14571 

31.27 

yes 

(yes) 

[yes] 

6 

0.15334 

29.45 

yes 

(yes) 

[yes] 

7 

0.19623 

21.85 

yes 

(yes) 

[yes] 

Multivariate  Test  Criteria 


If  the  Trace-criterion  >=  c  *  F-table  value (3 , 16 , 0 . 01 )  it  is 
signif icant->  Trace  Criterion:  113.5858/Table  value:  12.8234 

Canonical  Discriminant  Functions 


The  unstandardized  canonical  discriminant  functions  (UCDF) 
are  evaluated  from  the  non-zero  eigenvalues.  An  UCDF  is 
significant  if  the  Chisquare  statistics  >  Chisquare-table 
value (DF/ 0 . 01 ) . 

After  R. Wilks  Degrees  of  Chisquare  Table  Signi- 

Fct  Lambda  Freedom  Value  Value  ficance 


1 

0.00004 

21 

137.37589 

38.93217 

yes 

2 

0.00267 

12 

80.01103 

26.21696 

yes 

3 

0.09646 

5 

31.57141 

15.08627 

yes 

0  discriminant  function (s)  excluded  from  further  analysis. 
3  discriminant  function (s)  remained  after  the  test  of 
significance . 


Table  8:  Listing  in  extracts 
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Group 

A 

Group 

B 

Var. 

No. 

t-Statistics 

a-Level 

Significance 

1%  5%  10% 

1 

2 

1 

10.0669 

yes 

yes 

yes 

1 

2 

2 

12.6259 

yes 

yes 

yes 

1 

2 

3 

4.0255 

yes 

yes 

yes 

1 

2 

4 

5.0988 

yes 

yes 

yes 

1 

2 

5 

4.1253 

yes 

yes 

yes 

1 

2 

6 

1.9012 

no 

no 

yes 

1 

2 

7 

2.1328 

no 

yes 

yes 

1 

3 

1 

11.3433 

yes 

yes 

yes 

1 

3 

2 

11.9620 

yes 

yes 

yes 

1 

3 

3 

6.6115 

yes 

yes 

yes 

1 

3 

4 

3.2958 

yes 

yes 

yes 

1 

3 

5 

3.3411 

yes 

yes 

yes 

1 

3 

6 

2.6903 

no 

yes 

yes 

1 

3 

7 

1.6284 

no 

no 

no 

1 

4 

1 

6.5868 

yes 

yes 

yes 

1 

4 

2 

3.9061 

yes 

yes 

yes 

1 

4 

3 

4.1968 

yes 

yes 

yes 

1 

4 

4 

4.7020 

yes 

yes 

yes 

1 

4 

5 

3.8293 

yes 

yes 

yes 

1 

4 

6 

5.4751 

yes 

yes 

yes 

1 

4 

7 

5.2143 

yes 

yes 

yes 

2 

3 

1 

1.2764 

no 

no 

no 

2 

3 

2 

0.6639 

no 

no 

no 

2 

3 

3 

10.6370 

yes 

yes 

yes 

2 

3 

4 

8.3946 

yes 

yes 

yes 

2 

3 

5 

7.4664 

yes 

yes 

yes 

2 

3 

6 

4.5914 

yes 

yes 

yes 

2 

3 

7 

3.7612 

yes 

yes 

yes 

2 

4 

1 

3.4801 

yes 

yes 

yes 

2 

4 

2 

8.7198 

yes 

yes 

yes 

2 

4 

3 

0.1713 

no 

no 

no 

2  , 

4 

4 

0.3968 

no 

no 

no 

2 

4 

5 

0.2959 

no 

no 

no 

2 

4 

6 

3.5739 

yes 

yes 

yes 

2 

4 

7 

3.0815 

yes 

yes 

yes 

3 

4 

1 

4.7565 

yes 

yes 

yes 

3 

4 

2 

8.0559 

yes 

yes 

yes 

3 

4 

3 

10.8083 

yes 

yes 

yes 

3 

4 

4 

7.9978 

yes 

yes 

yes 

3 

4 

5 

7.1705 

yes 

yes 

yes 

3 

4 

6 

8.1653 

yes 

yes 

yes 

3 

4 

7 

6.8428 

yes 

yes 

yes 

Table  9:  Univariate  t-Statistics  estimates  how  two  groups  differ  with  respect  to  one  variable.  Alpha 
level/t“Table  value:  0.01  0.05  0.10/2.92  2.12  1.75  (The  local  null  hypothesis  is  rejected  if  the  t-statistics 
>  t-table  value.) 


1-21 


input  node  i.  Variable  a  is  the  ’’sigmoid”  function 
given  by 


and  is  called  the  activation  function  of  the  neural 
network. 

Function  g  may  be  the  same  as  the  activation  func¬ 
tion  or  may  be  a  different  function.  It  is  impor¬ 
tant  that  the  activation  function  is  nonlinear  and  has 
bounded  output. 

The  action  of  the  feedforward  network  is  deter¬ 
mined  by  its  architecture  (how  many  input,  hid¬ 
den,  and  output  nodes  it  has),  and  the  values  of  the 
weights.  The  numbers  of  input  and  output  nodes 
are  determined  by  the  application,  i.e.  how  many 
features,  how  many  groups  are  within  the  pattern 
recognition  tasks. 

The  number  of  hidden  nodes  is  a  variable  that 
can  be  adjusted  by  the  user.  This  adjustment  de¬ 
pends  upon  experience  of  the  user,  although  var¬ 
ious  methods  for  setting  the  number  of  hidden 
nodes,  or  pruning  away  unnecessary  nodes,  have 
been  proposed. 

With  the  architecture  given,  it  is  then  the  weight 
values  which  determine  how  the  network  performs. 
The  process  of  adjusting  these  weight  values  in  or¬ 
der  to  solve  a  certain  pattern  recognition  task  is 
called  the  training  of  the  network.  The  network 
learns  as  the  weight  values  are  being  modified  to 
achieve  the  training  goal. 

However,  the  weights  are  nothing  more  than  a  set 
of  parameters,  determining  the  behavior  of  a  par¬ 
ticular  function. 

4.2.3  Backpropagation  IVaining  Algorithm 

The  starting  point  for  any  training  procedure  is 
data. 

Training  data  consists  of  input-output  pairs  that 
have  b^n  generated  by  the  process  which  the  net¬ 
work  is  to  emulate.  For  specific  inputs,  the  feedfor¬ 
ward  network  will  produce  its  own  set  of  outputs. 
The  difference  between  the  network  outputs  and 
the  actual  desired  outputs  is  the  ’’error”  produced 
by  the  network.  While  training  the  network,  we 
want  to  reduce  this  error  to  as  small  a  value  as 


possible.  Suppose,  for  simplicity,  a  single  data 
point  (x,d)  consisting  of  an  input  vector  x  = 
(xi, , . .  jXiv)  and  an  desired  output  vector  d  = 
(di, . . . ,  dAf)-  For  a  given  set  W  of  weight  val¬ 
ues,  the  feedforward  network  produces  the  output 
vector  j/(W)  =  (yi(W), . . . ,  j/m(W)).  One  way 
to  express  the  error  e(W)  is  : 


,  M 

e(W)  =  -5](dfc-yfc(’W)) 
^  fc=l 


2 


For  a  good  performance  of  the  feedforward  net¬ 
work  the  weights  W  have  to  be  chosen  in  such  a 
manner  that  the  following  equation  is  valid; 


e(W)  Min. 


Notice  that  the  actual  value  of  Min.  depends  upon 
the  architecture  of  the  network.  Part  of  the  art  of 
training  is  determining  the  smallest  number  of  hid¬ 
den  nodes  that  will  do  the  job  best,  i.e.  with  small¬ 
est  value  of  Min. 

We  have  here  a  problem  of  convex  optimization 
which  can  be  solved  numerically  by  the  steepest 
descent  algorithm.  The  iteration  step  can  be  writ¬ 
ten  as: 


Wneu,  =  Wold  +  7  X  (-yrad(e(W))) 

Variable  W  is  a  vector  with  {N + 1)H +HM  com¬ 
ponents,  grad  denotes  the  gradient  with  respect  to 
W,  and  7  is  a  the  stepsize  that  controls  the  magni¬ 
tude  of  the  change  in  each  iteration. 
Backpropagation  training  is  nothing  more  than  the 
steepest  descent  algorithm  applied  to  the  vector  of 
weight  values  and  the  error  function. 

The  notion  of  ’’backward  propagation”  comes 
from  the  rather  complicated  expressions  that  arise 
when  one  actually  computes  the  components  of 
yrad(e(W))  by  applying  the  chain  rule  for  deriva¬ 
tives  numerous  times. 

Unfortunately,  steepest  descent  is  a  notoriously 
slow  algorithm,  requiring  many  iterations  to  reach 
an  acceptable  solution.  Therefore,  many  improve¬ 
ments  have  been  suggested,  but  they  are  beyond  our 
scope  here. 
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Input-  layer  ^  Hidden-  Layer  ^  ^ _ Output-  Lcjyer 


Figure  5:  Structure  of  the  three  layer  neural  network 


4.2.4  Example 

Here  we  deal  with  the  classification  of  the  data  set 
given  in  section  2.2  using  a  feedforward  network. 
The  neural  network  will  be  trained  using  the  given 
data  set  with  a  simple  rule: 

if  X  is  in  the  ith  group  then  the  ith 
component  ofy  is  one  and  the  other 
components  are  zero. 

The  output  vector  y  is  in  and  the  given  input 
vector  X  is  in  R^.  The  neural  network  function  will 
be  choosen  as 
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Figure  6  shows  the  classified  recognition  set.  The 
whole  data  set  is  correctly  classified.  The  output 


activities  for  the  prediction  set  are: 


=  cr(w^  •  cr(w^  •  +  b^^). 

The  a  function  normalizes  the  range  of  output  val¬ 
ues  to  the  range  (0, 1). 

Consequently,  the  output  activities  y  are  easy  to 
interpret.  Using  the  backpropagation  training  al¬ 
gorithm  with  normalized  random  distributed  initial 
conditions  the  weights  can  be  found  as: 
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which  is  completely  correct. 

Therefore  the  neural  network  function  found  by  the 
learning  step  represents  a  reliable  pattern  recogni¬ 
tion  model  for  this  process. 
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Observation#  vs.  output  activities  1  ..4 


Figure  6:  Observations  vs.  output  activities  y 


4.3  Fuzzy  Methods 
4.3.1  Introduction 

Fuzzy  logic^^  is  a  means  of  dealing  with  informa¬ 
tion  in  the  same  way  that  humans  or  animals  do, 
i.e.  in  imprecise  terms. 

Fuzzy  logic  is  built  around  the  concept  of  reason¬ 
ing  in  degrees,  rather  than  in  boolean  (yes/no,  0/1) 
expressions  like  computers  do. 

Variables  are  defined  in  terms  of  fuzzy  sets.  Rules 
are  specified  by  logically  combining  fuzzy  sets. 
The  combination  of  fuzzy  sets  defined  for  input  and 
output  variables,  together  with  a  set  of  fuzzy  rules 
that  relate  one  or  more  input  fuzzy  sets  to  an  output 
fuzzy  set,  comprise  a  fuzzy  system. 

Fuzzy  systems  represent  well-defined  static  deter¬ 
ministic  functions.  Therefore  reaction  of  a  fuzzy 
system  to  inputs  is  anything  but  fuzzy.  Inputs  are 
presented  to  the  system  as  specific  values,  and  the 
fuzzy  system  produces  a  specific  output  value.  The 
operation  of  a  fuzzy  system  is  thus  analogous  to 
that  of  conventional  control  systems. 

Fuzzy  systems  are  easy  to  design,  typically  require 
less  computer  power  than  alternative  approaches, 
and  provide  robust  operation. 

However,  since  they  can  be  regarded  as  an  induc¬ 
tive  approach,  their  use  in  pattern  recognition  ap¬ 
plications  can  be  a  quite  tricky  one. 


4.3.2  Fuzzy  Sets 


An  ordinary  set  divides  data  into  those  items  that 
are  completely  in  the  set  and  those  items  that  are 
completely  outside  of  the  set. 

We  can  describe  this  phenomenon  by  assigning  the 
value  1  to  all  those  data  which  are  members  of  the 
set  and  the  value  0  to  all  data  which  are  not  mem¬ 
bers  of  the  set.  For  ordinary  sets,  only  these  two 
values  are  existent.  The  function  which  assigns 
these  values  is  called  the  characteristic  function  of 
the  set. 

Fuzzy  sets  allow  the  possibility  of  degrees  of  mem¬ 
bership.  That  is,  any  of  the  values  between  0  and 
1  (including  0  and  1)  may  be  assigned.  For  exam¬ 
ple,  given  the  fuzzy  set  ’’fast  cars”,  we  may  speak 
of  a  particular  car  being  a  member  of  this  set  to  de¬ 
gree  0.8.  This  would  be  a  rather  fast  car,  but  not  the 
fastest  car  imaginable. 

The  function  which  assigns  this  value  is  called  the 
membership  function  associated  with  the^z^  set. 
Fuzzy  membership  functions  are  the  mechanism 
through  which  the  fuzzy  system  interacts  with  the 
outside  world.  The  range,  or  possible  output  val¬ 
ues,  of  a  membership  function  is  the  interval  [0, 1], 
the  set  of  all  real  numbers  between  0  and  1,  in¬ 
clusive.  A  typical  choice  for  a  fuzzy  membership 
function  is  a  piecewise  linear  trapezoidal  function. 
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4.3.3  Fuzzy  Rules 

Fuzzy  rules  combine  two  or  more  input  fuzzy  sets, 
called  the  antecedent  sets,  and  associate  with  them 
an  output,  or  consequent,  set.  The  antecedent  sets 
are  combined  by  means  of  operators  that  are  anal¬ 
ogous  to  the  usual  logical  conjunctives  ’’and,”  ”or,” 
etc. 

One  method  of  storing  and  representing  fuzzy  rules 
is  through  the  use  of  a  fuzzy  associative  memory 
(FAM)  matrix.  Figure  7  shows  an  example  of  a 
FAM  matrix.  In  this  example  there  are  two  inputs, 
X  and  y.  Each  input  variable  has  three  fuzzy  sets 
associated  with  it,  which  are  labeled  N,  ZE,  and 
P,  for  ’’Negative,”  ’’Zero,”  and  ’’Positive”  (these 
need  not  be  the  same  for  each  input  variable). 

The  output  variable  Z  has  five  fuzzy  sets  associ¬ 
ated  with  it:  NL  (Negative  Large),  NS  (Negative 
Small),  ZE  (Zero),  PS  (Positive  Small),  and  PL 
(Positive  Large).  Each  FAM  matrix  entry  is  an  out¬ 
put  fuzzy  set  that  is  the  consequent  of  a  fuzzy  rule. 
For  example,  the  shaded  entry  in  Figure  7  repre¬ 
sents  the  rule: 

If  X  is  Positive  (P)  and  Y  is  Zero 
(ZE),  then  Z  is  Positive  Small  (PS). 

(’’Zero”  is  a  fuzzy  set  that  would  typically  represent 
a  range  of  values  near  0.) 

FAM  matrices  can  have  also  dimensions  higher 
than  two.  The  number  of  inputs,  or  antecedents, 
to  the  fiizzy  rules  determines  the  dimension.  Three 
inputs  would  result  in  a  FAM  matrix  that  looks  like 
a  three-dimensional  cube.  Higher  numbers  of  in¬ 
puts  produce  Hyper-FAM  matrices. 

In  general  it  is  not  necessary  to  combine  all  system 
input  variables  into  all  of  the  fuzzy  rules.  However, 
this  depends  upon  the  structure  of  the  system. 

4.3.4  Fuzzy  System  Operation 

Now  there  are  fuzzy  sets  for  the  input  and  output 
variables,  as  well  as  fuzzy  rules.  How  can  a  de- 
fuzzified  output  be  obtained  from  a  fuzzy  system.? 
A  simple  method  will  be  demonstrated  at  the 
system  associated  with  the  FAM  matrix  shown 
in  Figure  7.  In  this  case,  there  are  two  input 
variables,  X  and  Y,  with  associated  fuzzy  sets 
N  (”NegatiVe”),ZP  (’’Zero”),  and  P  (’’Positive”). 
Figure  8  shows  how  the  membership  functions  look 
for  these  sets. 

The  same  membership  functions  will  be  used  for  X 


and  y.  However,  this  is  in  general  not  necessarily 
the  case. 

The  membership  functions  for  the  fuzzy  sets  N, 
ZE,  and  P  are  denoted  by  P^r,  Fze,  and  Fp, 
respectively,  A  particular  value  x  of  the  input 
variable  X  then  has  membership  degrees  Fm{x), 
Fze{^)^  and  Fp{x). 

For  example,  with  the  trapezoidal  membership 
functions  shown  in  Figure  8  and  a  value  x  —  0.8, 
we  get: 

Piv(0.8)  :=  0 
Pz£;(0.8)  =  0.7 
Pp(0.8)  =  0.2 


The  value  y  =  -0.7  (cf.  Figure  8)  will  give 

p^(^0.7)  =  0.13 
Pze(~0.7)  =  0.8 
Pp(-0.7)  =  0.0 


Now  we  present  x  =  0.8  and  y  =  -0.7  to  the  sys¬ 
tem  as  values  of  the  input  variables  X  and  Y.  The 
idea  is  to  assign  a  weight  value  to  each  entry  in  the 
FAM  matrix  by  taking  the  minimum  of  the  mem¬ 
bership  function  values  associated  with  that  entry. 
Consider  the  FAM  matrix  entry  corresponding  to 
X  a  member  of  the  fuzzy  set  ZE,  and  Y  a  member 
of  the  fuzzy  set  N  (cf.  Figure  9).  The  weight  wi 
associated  with  this  entry  is: 

wi  =  mm{PzE(0.8),Piv(-0.7)} 

=  mm{0.7, 0.13} 

=  0.13 


Clearly,  only  those  FAM  matrix  entries  which  have 
nonzero  membership-function  values  for  both  X 
and  y  will  have  nonzero  weights  associated  with 
them. 

In  this  sense  it  is  said  that  the  rules  corresponding 
to  these  entries  are  activated. 

The  marked  squares  in  Figure  9  show  the  four  acti¬ 
vated  rules  for  the  values  in  this  example.  In  addi- 
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tion  to  wi,  there  exist  three  more  nonzero  weights: 

W2  =  mm{F^£;(0.8),F^£;(— 0.7)} 

=  mm{0.7, 0.8} 

=  0.7 

ws  =  min{Fp  {0,8)  0.7)} 

=  mzn{0.2,0.13} 

=  0.13 


The  output  variable  Z  consists  of  five  fuzzy  sets: 
NL,  NS,  ZE,  PS,  and  PL. 

Rather  than  treat  these  as  fuzzy  sets,  we’re  going  to 
assign  specific  values  to  them. 

This  does  not  imply  that  the  fuzzy  system  output 
is  limited  to  a  finite  number  of  values.  This  will 
result  in  the  fact  that  NL,  NS,  ZE,  PS,  and  PL 
now  represent  specific  numerical  values. 

These  values  can  be  computed  as  follows: 

{w\NS  -\-w2ZE  -\-wzNS  -^w^PS) 

(Jut  = - 2 - 


W4  =  mm{Fp(0.8),  F^£;(~0.7)} 
—  mm{0.2,0.8} 

=  0.2 


With  more  general  output  fuzzy  sets,  determina¬ 
tion  of  the  defuzzified  output  involves  computation 
of  centroid  values  of  -  in  general  -  overlapping 
membership  functions. 
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Figure  7:  An  example  of  a  two-dimensional  FAM  matrix. 


Figure  8:  Fuzzy  membership  functions.  The  point  0.8  is  a  member  of  the  fuzzy  set  P  to  degree  0.2,  and  a 
member  of  the  fuzzy  set  ZE  to  degree  0.7.  The  point  -0.7  is  a  member  of  the  fuzzv  set  N  to  degree  0.13 
and  a  member  of  the  fuzzy  set  ZE  to  degree  0.8. 


4.3.5  Example  The  fuzzy  system  function  will  be  adjusted  in  the 

^  ^  same  way  as  shown  in  the  prcededing  section: 

Here  we  use  a  Sugeno-typc  fuzzy  inference  sys¬ 
tem. 
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Figure  9:  FAM  matrix.  The  shaded  entries  are  the  activated  rules 


if  the  input  vector  x  is  in  the  \th  group 
then  the  ith  component  of  output  vec¬ 
tor  y  is  one  and  the  other  components 
are  zero. 


Concerning  the  given  data  set  the  output  vector  y 
is  in  and  the  given  input  vector  x  is  in  R^. 

The  first-order  Sugeno  fuzzy  model  has  rules  in  the 
following  form: 


if  xi  is  Ai  and  ..  and  xj  is  then 
2:  =  Pi  •  -f  ..  4-  P7  •  ^7  + 


where  Xi  are  the  components  of  an  input  vector  x, 
Ai  are  the  fuzzy  sets,  pi  and  r  are  constants. 

Five  fuzzy  rules  will  be  used  to  model  the  fuzzy  in¬ 
put.  As  membership  functions  the  Gaussian  curve 
f{x)  =  will  be  employed. 

The  AND  linkage  will  be  defined  as  logical  prod¬ 
uct.  The  consequent  term  of  the  used  rule  includes 
four  output  functions.  Each  output  function  is  a 
weighted  linear  combination  of  the  fuzzy  input. 
The  defuzzification  is  simply  a  weighted  and  nor- 
malizied  average  of  the  output  functions  using  the 
activation  functions  of  each  rule.  All  constants  will 
be  calculated  with  the  least  squares  method. 

In  result,  each  input  vector  (recognition  set)  is  cor¬ 
rectly  classified.  Therefore,  learning  of  the  fuzzy 
system  worked  quite  well. 

However,  with  the  prediction  set  as  input  for  the 


classification  system  we  obtain 
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which  is  only  correct  for  pattern  3. 

This  shows  that  the  fuzzy  system  function  does  not 
represent  a  reliable  pattern  recognition  model  for 
this  process. 

The  fuzzy  inference  system  studied  here  can  there¬ 
fore  only  be  a  starting  step  to  build  a  robust  fuzzy 
classifier.  The  logical  linkages  of  each  rule,  the 
number  of  rules  used  and  the  type  of  membership 
functions  must  be  changed  to  achieve  correct  pat¬ 
tern  recognition.  The  optimization  process  will 
need  certainly  many  iterations  because  the  number 
of  degrees  of  freedom  is  high. 

This  simple  example  highlights  the  difficulties  and 
limitations  in  using  and  optimizing  fuzzy  inference 
systems. 

4.4  Neuro-Fuzzy  Methods^^ 

4.4.1  How  it  works  . . . 

The  main  advantage  of  fuzzy  systems  is  that  a 
desired  system  behaviour  can  be  implemented  by 
simple  reasoning  operations.  This  will  often  pro¬ 
vide  a  solution  with  a  small  effort  in  terms  of  cost, 
time,  and  manpower.  Additionally,  all  engineering 
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knowledge  can  be  used  to  improve  system  perfor¬ 
mance. 

However,  this  advantage  is  also  a  limitation,  be¬ 
cause,  like  in  pattern  recognition,  knowledge  has  to 
be  extracted  manually  from  data.  This  is  a  problem 
especially  with  large  and  noisy  data  sets. 

On  contrary,  because  a  neural  net  can  be  trained 
from  data,  it  provides  a  -  more  or  less  -  good  solu¬ 
tion  for  a  given  task  without  any  help  from  humans. 
Despite  this,  only  a  few  applications  exist  for  neu¬ 
ral  networks  in  practice.  This  is  mainly  due  to  the 
large  amount  of  computing  power  needed  to  solve 
even  moderate  sized  classification  tasks.  Further¬ 
more,  a  neural  net  remains  a  black  box,  since  it  can 
not  be  interpreted  what  certain  weights  may  mean. 

This  is  the  reason,  why  the  idea  arose  to  establish 
a  hybrid  system,  namely  a  combination  of  a  neural 
net  and  a  fuzzy  logic  system.  So  neuro-fuzzy  sys¬ 
tems  were  developed  which  combine  the  explicit 
knowledge  representation  of  fuzzy  logic  with  the 
learning  power  of  neural  nets. 

The  training  of  neuro-fuzzy  systems  is  rather  com¬ 
plicated  since  the  backpropagation  algorithm  can 
not  be  used,  because  it  needs  explicit  differentia¬ 
tion  of  the  activation  functions  of  the  neurons.  This 
is,  however,  not  possible  with  fuzzy  logic  rules, 
because  certain  mathematical  functions  like  mini¬ 
mum  and  maximum  are  used. 

The  most  common  solution  to  this  problem  is  the 
use  of  FAM  (fuzzy  associative  memory)  matrices. 
Algorithms  were  developed  which  map  FAM’s  to 
neurons. 

Therefore  error  backpropagation  methods  with 
neuro  fuzzy  systems  can  be  used. 

4.4.2  Example 

Last,  but  not  least  we  consider  the  Adaptive  Net¬ 
work  based  Fuzzy  Inference  System  (ANFIS). 
ANFIS  is  used  as  learning  algorithm  for  the 
Takagi-Sugeno  system  which  was  introduced  al¬ 
ready  in  section  4.3.5. 

The  Takagi-  Sugeno  fuzzy  system  can  be  rewritten 
as  a  feedforward  neural  network. 

Figure  10  shows  it  in  layer  structure  form.  The 
fuzzy  system  function  is  given  as: 

if  xi  in  Ai  AND  X2  in  Bi  then 
yi  =  ci,o  +  ci,ia;i  -h  Ci^2^2 


if  xi  in  A2  AND  X2  in  B2  then 

y2  =  C2,0  +  C2,lXi  -f  02,2^2 

with  the  defuzzification 

_  /3m  +  I32y2 
^  A/32 

when  the  activation  functions  of  each  rule  will  be 
denoted  as  /3i  and  ^2- 

The  layer  L5  can  be  defined  as  a  single  layer  Ada- 
line  network  with  variable  weights. 

The  activation  functions  of  each  rule  are  the  vari¬ 
able  weights.  The  parameters  can  be  adjusted  using 
a  backpropagation  algorithm  alone,  or  in  combina¬ 
tion  with  a  least  squares  method. 

The  learning  rule  will  be  in  our  case: 

if  the  input  vector  x  is  in  the  ith  group 
then  the  y  presents  the  value  i. 

The  group  numbers  are  here  0 ...  3. 

Figure  11  shows  the  used  structure.  This  struc¬ 
ture  consists  of  18  rules  with  gausssian  member¬ 
ship  functions  and  linear  output  functions. 

Thie  structure  was  trained  with  an  hybrid  algorithm 
of  backpropagation  and  the  least  squares  method. 
Figure  12  shows  the  -  correct  -  classification  re¬ 
sults  of  the  recognition  set. 

Also  all  members  of  the  prediction  set  were  classi¬ 
fied  correctly. 

Therefore  the  neuro  fuzzy  system  function  is  robust 
and  gives  a  perfect  pattern  recognition  system  with 
our  small  example. 

2/1=0.00,  2/2  =  1.01 

2/3  =  1.97,  2/4  =  2.92 

5  Conclusions 

A  short  introduction  to  pattern  recognition  has  been 
presented.  It  has  been  shown  that  powerful  meth¬ 
ods  exist,  however,  care  has  to  be  taken  to  build  ro¬ 
bust  and  consistent  classifiers.  The  best  approach 
for  the  unexperienced  user  seems  to  be  the  use  of 
classical  statistical  tools,  since  plug  and  play  works 
in  this  case.  Otherwise  a  profound  background 
knowledge  on  the  behaviour  of  the  methods  used  is 
needed,  since  mis-classification  or  overfitting  may 
occur  without  notice. 
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1.  SUMMARY 

In  identifying  a  target  as  a  member  of  a  target 
class  (e.gv  a  particular  model  of  aircraft)  one  can 
use  the  scattering  signatures  from  a  transient 
type  of  radar  (or  a  radar  with  a  large  number  of 
frequencies).  There  are  various  types  of  signa¬ 
tures  that  one  can  use  and  a  number  of  the  most 
common  ones  are  discussed. 

2.  INTRODUCTION 

One  way  to  identify  a  target  is  to  "look"  at  it  in 
the  sense  of  a  photograph  so  that  one  observes 
shape,  color,  etc.,  as  the  relevant  items  for  iden¬ 
tification.  This  is  often  referred  to  as  imaging 
and  implies  wavelengths  short  compared  to  di¬ 
mensions  of  interest  on  the  target,  and  quite 
small  angular  resolution  if  the  target  is  far  away. 
This  is  an  important  approach  to  target  recogni¬ 
tion,  but  not  the  approach  considered  here. 
Another  approach  is  called  inverse  scattering, 
but  this  is  quite  complicated  and  is  also  not  con¬ 
sidered  here. 

3.  SIGNATURES 

In  this  paper  we  consider  identification  by  means 
of  signatures.  For  this  we  consider  model-based 
parameters.  By  a  scattering  model  we  mean 
some  mathematical  expression  with  a  not-too- 
large  set  of  parameters  which  represent  the 
scattering  (exact  or  approximate)  over  some 
region  of  time,  frequency,  etc.  The  parameters 
may  be  aspect  dependent  or  aspect  independent 
(more  desirable).  So  we  can  define 

signature  type:  a  set  of  parameters  associated 
with  a  scattering  model 


signature:  a  set  of  specific  parameter  values 
(including  aspect  dependence)  associated  with  a 
signature  type  and  related  scattering  model. 

A  target  may  have  more  than  one  signature  due 
to  the  applicability  of  more  than  one  scattering 
model. 

4*  SCATTERING  MODELS 

There  are  various  scattering  models  of  interest 

including 

1.  Singularity  expansion  method  (SEM) 

-  poles  (natural  frequencies)  and  residues 

2.  Generalized  cone  (dilation  symmetry) 

3.  High-frequency  method  (HFM) 

-  asymptotic 

4.  Linear  array  of  scatterers 

5.  Scattering  centers 

6.  Low-frequency  method  (LFM) 

-  electric  and  magnetic  polarizabilities 

A  unifying  concept  for  these  models  is  symme¬ 
try,  either  of  the  whole  target  or  of  some  of  its 
substructures  (combined  with  temporal  isolation 
(causality)). 

5.  CONCLUDING  REMARKS 

There  is  already  a  large  literature  on  this 
subject.  The  accompanying  bibliography  em¬ 
phasizes  the  larger  review  papers  and  book 
chapters  where  an  enormous  number  of  refer¬ 
ences  (not  repeated  here)  can  be  found.  One  may 
think  of  this  as  a  metabibliography. 
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1.  SUMMARY 

Many  types  of  radar  systems  are  available  that  can 
obtain  the  band  limited  calibrated  impulse  response  of 
the  radar  target.  This  can  be  done  by  using  a  broad  band 
coherent  radar  and  then  transforming  from  the  frequency 
domain  to  the  time  domain,  or  it  can  be  done  by  actually 
transmitting  a  very  short  time  impulse  (or  other 
waveform)  and  receiving  the  voltage  versus  time 
response  of  the  signal  scattered  from  a  target  (the 
technique  shown  as  an  example  here). 

Considerable  information  about  the  radar  target 
electromagnetic  phenomenology  can  be  obtained  by 
time-frequency  analysis  of  the  resulting  band  limited 
impulse  response.  Such  time-frequency  analysis  of  the 
impulse  response  is  usually  at  the  upper  limit  of  spectral 
resolution  in  cases  where  the  target  has  resonant  effects 
over  the  radar  frequency  band.  In  this  paper,  we  will 
discuss  these  time  frequency  technniques  especially 
including  wavelet  transformations  and  show  examples 
for  a  specific  target  measured  using  an  impulse  radar. 

2.  INTRODUCTION 

The  ultra-wideband  radar  scattering  from  an  object 
contains  considerable  information  about  the 
phenomenology  of  the  target/electromagnetic 
interactions.  Often,  the  data  are  measured  as  scattering 
amplitude  and  phase  over  a  very  wide  band  of  frequency 
increments  (the  frequency  domain).  Recently,  there  has 
been  great  interest  in  applications  involving  transmission 
and  reception  of  very  short  impulses  (0.1  to  2  ns)  (or 
other  short-time/wide  bandwidth  waveform)  to  obtain  a 
(band  limited)  impulse  response  of  the  target.  The 
author  has  done  conciderable  recent  research,  for 
example,  on  a  class  of  radar  systems  that  transmit 
random  electromagnetic  noise  and  obtain  the  target 
(band  limited)  impulse  as  a  cross  correlation  of  the 
transmitted  signal  and  the  received  signal. 

It  is  possible  to  transform  from  one  domain  to  another 
(FFT  and  IFFT),  so  these  different  measurement 
techniques  are  equivalent  as  long  as  the  frequency  band 
is  equivalent.  Extraction  of  target  phenomenology  or 
radar  target  identification  is  possible  by  processing  the 
data  in  either  the  time  or  the  frequency  domain.  On  the 
other  hand,  it  can  be  shown  that  most  scattering  objects 


will  have  different  spectral  characteristics  at  different 
propagations  times.  The  early  time  scattering  may  be 
dominated  by  specular  scattering  (events  narrow  in  time 
extent)  and  the  late  time  may  be  dominated  by  resonant 
"ringing"  (events  narrow  in  frequency).  In  this  sense,  it 
is  necessary  to  consider  such  radar  targets  as  "non¬ 
stationary"  or  "dispersive"  because  the  spectral  response 
is  a  function  of  propagation  time.  One  can  obtain  images 
(signatures  or  mappings)  of  the  scattering  behavior  of 
the  target  by  computing  time-frequency  representations 
of  the  scattering.  We  will  discuss  several  algorithms  for 
doing  this  here  and  show  examples  of  the  resulting 
response  images. 

For  simplicity,  a  single  example  data  set  will  be  used  for 
the  discussions  here.  We  will  use  a  generic  missile  shape 
and  a  set  of  data  files  containing  data  for  the  scattering 
of  a  short  pulse  from  a  mock  missile  shape  (a  cylinder 
with  a  conic  fhistrum  shape  for  a  nose)  provided  by  Mr. 
Jamie  Henderson  and  Ms,  J, Arthur  [17],  In  the 
measurement  range  used  here  [17],  and  shown  in  Figure 
1,  a  short  EM  impulse  is  propagated  over  a  ground  plane 
to  the  missile  shape.  Scattering  points  on  the  mock- 
missile  are  labeled  in  the  figure.  As  the  pulse  propagates, 
it  is  detected  by  a  D-dot  probe  (a  voltage  probe  that 
responds  to  the  derivative  of  the  actual  E-field)  before  it 
illuminates  the  missile  and  also  after  it  scatters  from  the 
missile  (near-field  bistatic).  The  scattered  waveforms  in 
this  case  were  sampled  at  a  rate  of  0.02  ns  per  sample. 
The  digitizer  is  able  to  sample  the  wave  form  as  fast  as 
0.002  ns  per  sample.  This  provides  information  on  the 
waveforms  with  a  spectral  span  of  25  GHz.  Spectral 
analysis  of  the  data  shows  that  the  energy  in  the 
illumination  impulse  and  the  scattered  signal  is  limited  to 
less  than  8  GHz,  well  within  the  capability  of  the 
measurement  system. 

The  missile  was  constructed  with  a  removable  probe  (a  5 
cm  rod)  on  the  nose  attached  to  a  bulkhead  coaxial 
connector.  Depending  on  the  test,  the  bulkhead 
connector  was  terminated  inside  the  missile  with  an  open 
circuit  or  a  short  circuit. 

The  missile  orientation  is  shown  in  the  figure.  It  is  tilted 
at  a  45  degree  angle  toward  the  radar.  Note  that  there  is  a 
Styrofoam  support  block  not  shown  here  (which  may 
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cause  some  spurious  scattering).  The  data  files  discussed 
here  and  the  configurations  are  summarized  below. 


Table  1.  Missile  configurations 


FILE  NAME 

PROBE 

INTERNAL 

7USA3 

NO 

SHORT 

6USA8 

NO 

OPEN 

7USA2 

YES 

SHORT 

7USA9 

YES 

OPEN 

The  missile  is  a  cylinder/frustrum  with  a  removable 
conductive  probe  on  the  nose.  The  cylinder  is  1 18.70 
cm  long,  the  frustrum  14.6  cm  long.  The  diameter  of  the 
cylinder  and  base  of  frustrum  is  15.75  cm  tapering  to  8.5 
cm.  The  probe  was  5.35  cm  long.  The  bulkhead  coaxial 
connector  at  the  tip  may  have  a  short  circuit,  open  circuit 
or  matched  load  attached.  There  was  also  a  data  file 
called  IMPULSE2.DAT  which  is  the  signal  radiated 
from  the  TEM  antenna  as  sensed  by  the  D-dot  probe.  It 
is  approximately  the  time  derivative  of  the  field  incident 
upon  the  scatterer  and  can  be  used  for  calibration. 


The  goal  is  to  use  time  and  frequency  based  signal 
processing  techniques  to  extract  signatures  from  the  data 
which  reveal  the  behavior  of  the  time  and  frequency 
dependent  response  of  the  missile.  A  number  of 
algorithms  were  implemented  and  used  to  analyze  the 
data  and  are  described  in  the  next  several  sections  of  this 
paper. 

3.  EVALUATION  OF  POTENTIAL  SCATTERING 
MECHANISMS 

If  we  consider  the  possible  scattering  mechanisms  listed 
below,  we  can  estimate  the  propagation  times  and 
evaluate  those  regions  on  the  time  plots  and  in  the  time 
frequency  studies  that  follow. 

1 .  Source  top  of  frustrum  ^  Ddot  probe: 

Time  =  0.0  ns  (reference). 

2.  Source  top  of  frustrum  Ddot  probe: 

Time  =  -2.0ns 

3.  Source  top  of  frustrum  ^  base  Ddot  probe: 
Time=  4,0  ns. 

4.  Source  ^  top  of  frustrum  ^  base  ^  frustrum  ^ 
Ddot  probe:  Time  =  8.3  to  9.05  ns. 

Also,  if  we  consider  the  fully  illuminated  top  of  the 
frustrum,  we  see  that  it  may  scatter  an  attached  wave 
traveling  down  the  body  of  the  missile.  It  will  be  a  fairly 
strong  term.  Thus  items  3  and  4  above  are  likely 
candidates.  If  we  look  at  the  actual  measured  data,  we 
clearly  see  1,  2  and  4.  Thus  in  the  actual  data,  the  ’’base 
to  D-dot  probe"  term  is  not  seen.  Finally,  the  ring-type 
discontinuity  that  the  top  of  the  frustrum  presents  will 
scatter  energy  toward  the  D-dot  probe  at  a  set  of 
frequencies  where  the  multiple  paths  (near  side/far  side 


etc.)  add  in  phase  in  the  direction  of  the  D-dot  probe. 

An  estimate,  based  only  on  path  distances,  gives 
enhanced  scattering  toward  the  D-dot  probe  at 
approximately  2,  4,  6  and  8  GHz.  When  we  look  at  the 
frequency  behavior  of  this  mechanism  using  the  time 
frequency  distribution  (TFD)  analysis,  we  will  see  such 
spectral  behavior  for  this  term. 

If  there  are  effects  due  to  the  probe  attached  to  the  top  of 
the  frustrum,  we  expect  to  see  probe  induced  changes  in 
the  frustrum  top  term,  or  resonances.  The  resonances 
may  be  due  to  the  probe  itself,  or  due  to  the  cavity  inside 
the  missile  body  (if  the  bulkhead  connector  is  open 
circuited).  We  can  look  for  spectral  effects  near  the 
probe  scattering  term.  The  inside  of  the  missile  body 
may  also  behave  like  a  shorted  waveguide.  We  may  see 
dispersive  scattering  due  the  shorted  end  of  this  circular 
waveguide  for  the  case  of  the  probe  with  the  open 
circuit.  The  time  delay  of  this  term  from  the  direct  probe 
scattering  term  will  be  dispersive  but  will  be 
approximately  the  same  as  the  external  delay  term 
discussed  above  (Number  4  at  8  to  9  ns  from  the  direct 
top  of  frustrum  term). 

4.  CALIBRATION 

The  scattering  data  measured  by  the  D-dot  probe  is  the 
derivative  of  the  actual  E-field.  It  is  necessary  to 
integrate  the  data  in  order  to  obtain  a  voltage  waveform 
which  is  directly  proportional  to  the  E-field  detected 
voltage.  Even  if  this  is  done,  the  resulting  E-field 
waveform  contains  time  and  frequency  dispersive  effects 
due  to  the  signal  source,  antenna,  detector  (D-dot  probe), 
coaxial  transmission  lines  and  sensor.  We  may  wish  to 
also  remove  these  effects. 

It  would  be  possible  to  remove  these  dispersive  effects  if 
we  had  scattering  data  on  a  radar  scatterer  with  a 
theoretically  known  RCS  (and  thus  radar  impulse 
response).  To  do  that,  we  would  take  the  raw  data 
scattered  from  the  target  under  test  (TUT),  V^^(t)  and 
the  raw  data  scattered  from  the  calibration  target  V^(t) 
and  integrate.  This  would  give  h^^(t)  and  h^(t),  the 
voltage  waveforms  proportional  to  the  scattered  E-fields 
for  the  target  under  test  and  the  calibration  target.  At  this 
point,  we  would  take  the  Fourier  transform  of  h^^(t)  and 
h^(t)  to  obtain  S|^^(w)  and  S^(w),  the  frequency  spectra 
proportional  to  the  scattered  E-fields.  Finally,  we  would 
compute  S^^|(w),  the  calibrated  scattered  field  in  units  of 
V(area)or  (V(RCS)). 

Sea,(C0)=S„».|^ 

We  would  have  the  square  root  of  RCS  (at  this  point  a 
complex  number)  of  the  target  as  a  function  of 
frequency. 

Now  at  this  point,  a  large  part  of  the  calibrated  spectrum 
would  still  be  incorrect.  This  is  because  over  a  large  part 
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of  the  spectrum,  the  spectral  energy  will  be  very  low 
because  the  system  can  not  produce  or  radiate  energy  in 
these  frequency  bands.  Typically,  only  the  band  where 
the  system  efficiently  produces  and  radiates  energy  will 
be  valid.  This  region  can  be  determined  by  integrating 
the  D-dot  data  and  transforming  it  to  the  frequency 
domain.  Only  the  non-zero  (approximately  zero)  region 
will  be  valid.  Outside  of  this  band,  the  denominator  in 
the  calibration  equation  is  very  small  and  the  result  is 
dominated  by  measurement  noise.  We  set  the  unreliable 
low  frequency  data  points  to  zero  and  throw  away  the 
unreliable  high  frequency  data  points.  (Danger  -  Do  not 
throw  away  the  low  frequency  part  of  the  array,  we  will 
need  this  string  of  zeros  extending  from  the  first  useful 
frequency  point  to  the  DC  value  later).  Finally,  if  we 
wish,  we  may  take  the  Fourier  transform  of  this  final  set 
of  data  to  yield  the  calibrated  impulse  response  of  the 
target  under  test. 

Unfortunately,  in  the  available  data  set  in  this  example, 
we  do  not  have  scattered  data  from  a  calibration 
reference.  We  do  have  data  on  the  incident  E-field  as 
measured  by  the  D-dot  probe.  If  we  assume  that  the  data 
from  the  D-dot  probe  is  flat,  we  can  perform  a 
transformation  of  the  data  which  will  permit  the  relative 
behavior  of  the  scattered  field  to  be  determined.  In  this 
case,  we  integrate  both  the  incident  voltage  waveform 
and  the  scattered  waveform.  We  have  done  this,  and 
some  of  the  results  will  be  shown  in  the  next  section. 

Having  said  all  this,  it  is  important  to  make  an 
observation  about  the  Fourier  transform.  We  know  that 
the  Fourier  transform  of  the  integral  of  any  time  series  is 
simply  1/jco  times  the  Fourier  transform  of  that  time 
series.  Thus  the  process  of  taking  the  integral  simply 
introduces  a  spectral  roll-off  in  the  result  as  a  function  of 
frequency.  If  we  do  this,  we  note  that  at  8  GHz  the  raw 
data  PSD  is  17  dB  stronger  than  the  integrated  data  PSD. 
Also,  given  the  type  of  data  measured  here,  I  believe  that 
we  can  trust  the  dynamic  range  to  extend  to 
approximately  35  dB.  So,  if  -35  dB  is  used  as  a  criteria, 
the  raw  data  has  an  upper  frequency  limit  of 
approximately  9.5  GHz,  while  the  integrated  waveform 
has  an  upper  frequency  limit  of  8  GHz. 

For  the  studies  to  be  done  here,  rapid  frequency  rolloff  is 
a  disadvantage.  We  want  to  observe  the  spectral 
response  of  resonant  structures  on  a  test  target  (the 
missile  shape).  This  integration  process  will  actually 
suppress  these  effects.  We  have  done  studies  both  with 
and  without  integration  and  have  shown  that  the  resonant 
effects  that  we  wish  to  see  are  more  clearly  revealed  if 
the  integration  process  is  not  done.  Note  also  that  the 
PSD  for  the  raw  data  is  more  flat  (has  less  rapid  rolloff) 
than  when  the  integration  is  done.  For  these  reasons,  we 
will  show  most  of  the  spectral  responses  in  the  following 
section  by  processing  only  the  raw  data.  The  only 
penalty  will  be  that  the  impulse  response  that  we  get  will 


not  be  calibrated.  The  higher  frequencies  will  be 
enhanced  in  our  work. 

5.  SHORT  TIME  FOURIER  TRANSFORM 
ANALYSIS 

5.1  The  Short  Time  Fourier  Transform  Algorithm 
Much  of  this  work  is  concerned  with  the  behavior  of  the 
spectrum  of  the  mock  missile  as  a  function  of  time  delay. 
This  analysis  will  attempt  to  reveal  the  specific  onset  of 
resonance  effects  as  is  expected  from  a  resonant 
scatterer.  The  analysis  will  consist  of  forms  of  time 
frequency  representations  (TFR's)  or  time  frequency 
distributions  (TFD’s)  [12,  14  to  20,  22  to  24].  These  are 
representations  or  mappings  of  the  spectral  behavior  of 
the  target  scattering  as  a  function  of  time.  The  most 
intuitive  of  these  algorithms  is  the  short  time  Fourier 
transform  (STFT).  In  this  algorithm  (in  our  case),  the 
time  domain  data  are  broken  into  overlapping  segments 
(a  rectangular  window)  and  each  segment  is  then 
Fourier  transformed  into  the  frequency  domain  and  the 
amplitude  squared  taken  to  form  the  power  density 
versus  frequency  as  a  function  of  time.  We  then  plot  the 
data  as  a  two  dimensional  color  mapping  with  frequency 
vertical  and  time  horizontal.  The  color  scale  for  the 
image  is  scaled  to  the  radar  scattering  power  at  each 
particular  frequency  and  time.  The  algorithm  is  sketched 
in  Figure  2. 

We  have  applied  this  algorithm,  and  a  data  subtraction 
version  (target  with  probe  minus  target  without  probe)  to 
each  of  the  data  files.  The  results  are  given  in  the  next 
section. 

5.2  STFT  Applied  to  the  Tilted  Missile 

If  we  apply  the  STFT  to  the  45  degree  tilt  case,  we  can 
look  for  interaction  terms  and  resonance  terms  at 
different  times.  We  have  data  with  and  without  the  5  cm. 
probe  and  for  the  cases  of  a  short  circuit  on  the 
connector  (on  the  inside)  and  an  open  circuit. 

Consider  first  the  45  deg.  tilt  missile  with  no  probe  and 
with  a  short  circuit  on  the  bulkhead  connector  inside  the 
missile  (7USA3)  as  shown  in  Figure  3.  Note  that  a 
distinct  response  is  found  at  approximately  2.3  ns,  7.3  ns 
and  1 1.0  ns.  These  responses  correspond  to  STFT 
responses  (shifted  2  ns  earlier  as  mentioned  above)  as 
listed  below 


SPECTRAL 

MECHANISM 

TIME 

RESPONSE 

TIMING 

0.3  ns 

1.6  GHz  response 

top  and  side  of 
frustrum 

5.2  ns 

2.4  to  0.4  GHz  response 

top  to  base  to  D-dot 

9.0  ns 

6.4  GHz  to  DC  resp. 

top  to  base  to  top  to 

(dispersive) 

D-dot 
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The  mechanisms  were  previously  listed  in  section  3, 

Also  note  the  small  response  at  approximately  5,6  GHz 
at  0.3  ns  (STFT).  Perhaps  this  is  a  higher  order  term 
(double  bounce)  related  to  the  strong  1 .6  GHz  effect  at 
this  time.  The  triple  bounce  term  at  9,0  ns.  in  the  STFT 
display  is  particularly  interesting  because  of  its 
dispersive  nature  (frequency  response  is  function  of 
delay  time).  Since  there  is  no  probe  on  the  missile,  none 
of  this  behavior  is  due  to  the  probe. 

Next,  we  applied  the  same  algorithm  to  7USA2,  as 
shown  in  Figure  4.  In  this  case,  there  is  a  probe  on  the 
tip  of  the  missile,  but  the  inside  is  terminated  in  a  short 
circuit,  so  there  can  be  no  internal  effects.  We  also 
applied  the  STFT  algorithm  to  6USA9  in  Figure  5, 
where  the  inside  of  the  bulkhead  connector  is  open 
circuited.  In  this  case,  there  is  an  opportunity  for  some 
signal  propagation  and  resonances  due  to  the  inside  of 
the  missile  case.  Unfortunately,  there  are  no  obvious 
differences  in  these  three  figures.  The  addition  of  the 
probe  causes  no  strong  effect  in  the  data. 

We  also  applied  the  STFT  algorithm  to  the  response  data 
after  integration.  The  result  should  be  more  truthful  in 
the  sense  that  the  roll-off  in  frequency  should  be 
included.  The  result,  shown  in  Figure  6  for  6USA9  (with 
probe  and  open  circuit),  shows  nearly  equal  spectral 
energy  levels  at  the  three  times  identified  here.  The 
frequency  coverage  is  also  nearly  the  same  (approx.  5 
GHz  to  DC  as  seen  in  the  figure). 

In  order  to  look  into  the  details  of  the  scattering 
differences  between  the  probe  and  the  no-probe  case,  we 
also  performed  pair  subtraction  prior  to  STFT  analysis. 
An  example  where  the  data  for  the  missile  with  no  probe 
and  a  short  circuit  (7USA3)  is  subtracted  from  the 
scattering  data  for  the  missile  with  a  probe  and  an  open 
circuit  (6USA9)  is  shown  in  Figure  7.  Note  in  the  top 
plot  that  the  strong  isolated  responses  seen  in  the 
previous  figures  have  been  suppressed. 

The  initial  scattering  term  (the  fhistrum  scattering) 
shows  a  dispersive  and  strong  term  shifting  from  2.5 
GHz  to  1 .8  GHz  over  the  time  frame  from  0.2  ns  to  2.5 
ns.  There  are  also  response  terms  at  4.2  GHz  and  6  GHz 
(also  somewhat  dispersive).  The  scattering  at  5.2  ns 
(fhistrum  to  base  to  probe)  extends  from  6.5  GHz  to  DC 
and  is  somewhat  dispersive  also. 

The  scattering  at  9  ns  (frustrum-base-frustrum-D-dot 
probe)  is  now  very  extended  in  frequency.  The  strongest 
response  is  at  2.7  GHz,  but  there  are  other  terms  at  5  and 
7  GHz  and  above.  These  terms  are  not  dispersive.  This 
plot  indicates  that  the  double  and  triple  diffraction 
mechanisms  are  quite  strong.  Their  behavior  can  be 
extracted  using  the  STFT  processing. 

An  important  question  is  whether  the  multiple  terms  in 
Figure  7  at  9  ns  are  caused  by  multiple  scattering  of 


attached  modes  on  the  outside  of  the  skin  of  the  missile 
or  if  they  represent  terms  that  couple  to  the  inside  of  the 
missile  shape  via.  the  open  circuit  and  probe  and  travel 
to  the  base  and  back  on  the  inside  of  the  missile.  Both 
cases  would  imply  scattering  from  the  neighborhood  of 
the  firustrum/probe  to  the  base  and  back  to  the 
finstrum/probe  and  then  to  the  D-dot  probe.  One  way  of 
determining  the  difference  is  demonstrated  in  Figure  8. 
In  this  figure  we  subtract  data  for  the  case  of  the  probe 
with  the  short  circuit  on  the  inside  (7USA2)  from  the 
data  for  the  missile  with  the  probe  with  an  open  circuit 
on  the  inside  (6USA9).  The  only  difference  in  the  two 
cases  is  the  open  and  short  circuit  on  the  inside  of  the  5 
cm  probe  on  the  nose.  Note  in  the  figure  the  strong 
response  at  9  ns.  This  is  the  time  that  corresponds  to  a 
signal  traveling  from  the  tip  to  the  base  and  back  to  the 
tip  where  it  re-radiates  to  the  D-dot  probe.  We  can 
speculate  that  the  strong  multi-frequency  response  must 
be  due  to  internal  propagation.  Unfortunately,  the 
change  from  the  short  to  the  open  circuit  also  changes 
the  impedance  and  resonant  behavior  of  the  5  cm  probe 
itself  Thus  even  though  the  signal  may  travel  on  the 
outside  of  the  missile,  the  change  in  the  5  cm  probe 
impedance  would  change  the  behavior  of  the  signal  from 
one  data  set  to  the  other.  That  could  account  for  the 
response  we  see  at  9  ns  in  Figure  8  also. 

Figure  9  is  an  attempt  to  resolve  this  question  using  the 
data  provided.  In  this  case,  we  take  the  difference  of  two 
sets  of  data,  one  with  a  5  cm  probe  on  the  tip  (7USA2) 
and  one  without  (7USA3).  Both  of  these  missile  shapes 
have  a  short  circuit  on  the  bulkhead  on  the  inside  and 
thus  prevent  any  signal  from  propagating  inside  the 
missile.  Note  that  there  is  a  strong  response  at  1 .6  GHz  at 
1.5  ns  which  would  correspond  to  a  direct  probe  effect. 
On  the  other  hand,  there  is  no  strong  response  at  9  ns, 
which  would  be  a  tip  to  base  to  tip  to  D-dot  probe  term. 
This  implies  that  the  external  signal  mode  is  very  small 
and  that  the  response  shown  in  Figure  8  at  9  ns  is  due  to 
internal  propagating  signals. 

This  is  still  not  a  definitive  proof  of  internal  propagating 
modes.  To  help  distinguish  internal  and  external  modes, 
one  may  put  a  baffle  plate  at  some  distance  from  the 
base  of  the  missile  shape  on  the  inside.  This  would 
change  the  timing  and  resonant  behavior  of  the  internal 
modes  while  leaving  the  external  modes  unchanged. 

We  have  also  applied  the  STFT  analysis  to  the  integrated 
data  set  corresponding  to  Figure  7.  The  result  is  shown 
in  Figure  10.  Once  again,  we  are  subtracting  the  missile 
shape  with  no  probe  and  an  internal  short  from  the  case 
where  there  is  a  5  cm  probe  on  the  tip  (7USA3)  and  an 
open  circuit  on  the  inside  (6USA9).  (Remember  that  the 
color  scale  is  now  logarithmic.)  The  response  shows 
similar  general  behavior  as  given  in  Figure  7 
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6.  WIGNER  ANALYSIS 

6.1.  The  Wigner  Algorithm 

The  short  time  Fourier  transform  algorithm  has  been 
used  to  find  the  frequency  spectral  behavior  of  the 
scattering  from  the  missile  shape  as  a  function  of  time. 
This  algorithm  uses  a  fixed  time  window  for  all 
frequencies,  and  thus  has  a  reduction  in  frequency 
resolution  as  the  frequency  band  is  lower.  Thus  a  major 
limitation  is  in  finding  an  appropriate  window  for  the 
tradeoff  between  time  and  frequency  resolution.  In  many 
applications,  this  is  undesirable  because  the  requirement 
is  that  the  spectral  resolution  must  be  as  high  as  possible 
over  the  signal  processing  domain. 

This  problem  is  addressed  directly  by  considering  the 
energy  distribution  versus  the  variables  of  time  and 
frequency.  A  non-linear  phase  space  energy  distribution 
approach  was  originally  developed  by  Wigner  [1]  in  a 
quantum  mechanics  context.  The  theory  is  developed  in 
several  references  [1,2  and  3]  and  in  the  Moghaddar 
Ph.D.  dissertation,  [24],  and  will  not  be  re-derived  here. 

The  Wigner  distribution,  although  non-linear,  satisfies 
the  condition  that  a  summation  over  frequency  (or  time) 
at  a  particular  time  (or  frequency)  yields  the  total  energy 
at  that  time  (or  frequency).  This  is  called  satisfying  the 
marginals. 

The  most  important  property  of  the  Wigner  distribution 
is  that  it  can  be  shown  that  it  has  the  highest  signal 
concentration  in  the  time-frequency  plane.  As  a  serious 
disadvantage,  however,  since  it  is  a  non-linear  technique 
it  generates  spurious  cross-terms  in  that  plane.  This 
means  that  interpretation  can  be  very  difficult.  This  is 
why  we  have  chosen  to  perform  the  STFT  first  and  to 
attempt  interpretation  of  the  STFT  data  first  in  this  study. 

The  algorithm  [following  Moghaddar]  is  given  for  a 
discrete-frequency  signal  S(n)  as 


W(t,n)= 
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2N 
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k=-2N 


where: 

n  is  the  discrete  variable 
f  is  the  frequency 
t  is  the  continuous  variable 


In  this  particular  work,  we  consider  the  discrete  variable 
as  the  electromagnetic  impulse  response  of  the  target. 
Thus  we  exchange  time  and  frequency  in  the  above 
equation  and  remember  that  the  impulse  response  is  a 
scalar  and  not  complex. 

6.2.  The  Wigner  Algorithm  Applied  to  the  Tilted 
Missile 

We  can  demonstrate  the  Wigner  algorithm  using  the  45 
degree  missile  orientation  data.  Examples  are  shown  in 


Figure  10  which  can  be  compared  with  Figure  3 
(7USA3;  45  deg;  no  probe,  short  inside).  The  STFT 
shows  the  initial  response  (0.5  ns)  to  be  a  single  diffuse 
return  at  1 .5  GHz  or  so,  while  the  Wigner  (4.5  ns) 
resolves  the  return  into  3  or  4  individual  terms.  There  is 
a  return  5  ns  later  in  both  cases,  but  the  Wigner  resolves 
it  into  two  terms.  Finally,  at  9  ns  later,  both  algorithms 
show  a  term  which  lowers  in  frequency  in  the  later  (0.5 
ns)  phase,  but  the  Wigner  shows  another  low  frequency 
term.  Both  techniques  reveal  a  term  near  6  GHz  at  9  ns 
(identified  as  a  delay  corresponding  to  "top  to  base  to  D- 
dot"  in  section  5.2)  from  the  initial  return,  but  once  again 
the  Wigner  resolves  it  into  two  terms  approximately  0.4 
ns  apart.  This  0.4  ns  corresponds  to  a  spacing  delay  on 
the  target  (two-way)  of  6  cm  spacing.  This  is  a 
reasonable  spacing  across  the  8.5  (tilted)  top  of  the 
frustrum  or  it  may  correspond  to  a  extra  delay  based  on 
the  diagonal  extent  of  the  double  bounce. 

We  can  also  compare  the  Wigner  for  7USA2  (45  deg.; 
probe;  short  inside)  shown  in  Figure  11.  We  expect  to 
see  a  change  in  those  terms  where  the  probe  influences 
the  result.  In  fact,  at  this  angle,  we  expect  all  of  the  terms 
to  involve  the  top  of  the  missile  and  thus  can  look  for 
effects  in  each  mechanism.  A  careful  comparison  shows 
some  differences  in  the  later  time  of  the  cluster  of  terms 
between  4.5  and  5.6  ns  (I.E.  the  term  near  5.6  ns).  We 
also  see  small  terms  near  14.5  ns  in  Figure  4  (with 
probe)  that  are  not  seen  in  Figure  10  (no  probe).  These 
may  be  caused  by  the  addition  of  the  probe  on  the  tip. 
Note  the  higher  resolution  of  the  scattering  terms.  Also 
note  the  terms  near  7  and  1 1  and  17  ns.  These  are  due  to 
interactions  that  are  the  result  of  the  non-linear  nature  of 
the  Wigner  algorithm.  (There  are  ways  to  reduce  this 
effect,  but  they  are  not  necessary  here.) 

We  also  applied  the  Wigner  algorithm  to  6USA9  (45 
deg;  probe;  open  inside)  as  seen  in  Figure  12.  In  this 
case,  the  only  change  between  this  figure  and  Figure  4  is 
the  change  on  the  inside  of  the  bulkhead  connector  from 
a  short  circuit  to  an  open  circuit.  There  are  some 
changes  near  5.9  ns  (the  later  part  of  the  initial  term, 
tentatively  identified  as  a  scattering  from  the  fhistrum 
area  directly  to  the  d-dot  probe.  These  changes  are  not 
specific  enough  to  be  unambiguously  identified  as 
caused  by  the  change  from  the  short  to  the  open  circuit. 

Once  again,  it  is  seen  that  the  resolution  in  the  time- 
frequency  domain  is  better  for  the  Wigner  than  for  the 
STFT.  The  identification  of  specific  terms  is  still 
difficult  because  the  experimental  set-up  was  not 
specifically  for  this  type  of  analysis. 

6.3.  The  Wigner  Algorithm;  Summary 

We  have  shown  the  use  of  the  Wigner  time-frequency 
algorithm  to  evaluate  the  impulse  scattering  from  the 
missile  shape.  The  algorithm  has  been  shown  to  have 
higher  resolution- than  the  STFT.  The  interaction  terms 
were  not  a  problem  in  this  application,  mostly  because 
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the  STFT  resolution  was  sufficient  to  resolve  most  of  the 
interaction  ambiguities.  It  is  never-the-less  possible  that 
some  of  the  fine-grain  behavior  that  we  discussed  was 
actually  some  type  of  interaction. 

The  experiments  could  be  modified  in  the  future  to 
specifically  reveal  some  of  the  specific  individual 
mechanisms  in  the  interaction.  Very  careful  (internal 
motor  driven)  probe  extension  as  data  are  taken  is  one 
example, 

7.  WAVELET  ANALYSIS 
7.1.  Wavelet  Analysis 

It  is  well-known  that  it  is  impossible  to  perform  spectral 
analysis  with  infinite  resolution  in  the  time  and 
frequency  domain  simultaneously.  (We  might  have  a 
deterministic  formula  for  the  time  domain  behavior,  for 
example,  and  then  we  could  get  infinite  resolution  in  the 
frequency  domain.  This  is  what  happens  with  model- 
based  superresolution  techniques.)  The  most  common 
spectral  analysis  technique  (the  Fourier  technique)  is 
actually  a  type  of  convolution  between  finite  data  sets 
and  sinusoids.  (Windowing  techniques  make  up  for 
some  of  the  flaws  of  this  approach.)  The  limits  of  the 
resolution  are  fundamentally  due  to  the  finite  nature  of 
the  data  sets  and  the  convolution  process. 


frequency  and  then  perform  the  convolution.  The  result 
is  a  very  useful  type  of  wavelet.  In  fact,  the  STFT  and 
the  wavelet  transform  can  be  generalized  as  simply  two 
types  of  cross-ambiguity  functions  and  they  have  the 
same  general  properties  [13]. 

Before  we  get  into  the  application  of  the  wavelet 
transform  here,  the  important  concept  of  "scale"  must  be 
discussed.  The  development  of  the  wavelet  transform 
leads  to  issues  of  orthogonality,  completeness  and 
energy  conservation  (of  course).  The  result  is  that  the 
natural  units  to  use  instead  of  frequency  is  "scale."  This 
is  because  equal  increments  of  frequency  would  produce 
variations  in  the  resolution  and  thus  we  would  move 
from  an  optimal  frequency  domain  sampling  to  over- 
sampling  or  under-sampling.  The  only  scheme  which 
works  is  a  uniform  "scale"  axis.  (Refer  to  the 
references.)  Thus  the  plots  given  here  will  be  given  in 
units  of  "scale"  versus  time. 

We  will  use  the  discrete  wavelet  transform  (DWT)  here. 
Simply  put,  we  use  a  prototype  waveform  (a  "mother 
wavelet")  and  convolve  it  with  the  d-dot  probe  time 
domain  waveform.  All  other  wavelets  are  stretched  or 
compressed  versions  of  the  prototype.  The  wavelet 
prototype  h^(t)  is  given  as 


Wavelet  spectral  analysis  recognizes  immediately  that 
the  data  set  to  be  processed  is  to  be  convolved  with  a 
specific  waveform  (the  wavelet),  and  specifies  a 
resolution  in  frequency  and  in  time  by  the  careful  design 
of  the  wavelet  [13,  14,  16,  20,  21, 22  and  24].  The  result 
is  a  time-frequency  domain  representation  of  the 
waveform  where  the  resolution  is  optimized  over  the 
domain  of  time  and  frequency.  Entire  books  have  been 
written  on  the  wavelet  transform,  and  thus  a  full 
discussion  is  beyond  the  scope  of  this  paper.  See  the 
very  nice  survey  article  by  Rioul  and  Vetterli  in  the 
IEEE  Signal  Processing  Magazine  [13],  or  try  the  book 
A  Friendly  Guide  to  Wavelets  by  G.  Kaiser  [21].  There 
are  also  a  number  of  world  wide  web  pages  on  the 
Internet  where  more  background  can  be  found. 


ha(t)^ 


where  a  is  the  "scale  factor."  Thus  the  definition  of  the 
DWT  is 


DWT(t,a)= 


X(x(t>h(H).A0 


Note  that  variations  in  time  and  scale  are  imbedded  in 
this  convolution.  Also  note  the  inefficiency  of  this 
calculation  because  the  tricks  used  in  the  "fast"  Fourier 
transformation  are  not  used.  (A  "fast"  DWT  is  available, 
however;  see  [21]) 


Consider  the  short  time  Fourier  transform  (STFT) 
discussed  earlier  in  this  report.  In  this  case,  a  sinusoid  is 
windowed  to  produce  a  finite  time  duration  sinusoid 
which  is  then  convolved  with  the  waveform.  Actual 
implementation  may  change  the  order  of  the  process,  but 
we  must  have  a  triple  product  of  the  original  waveform 
to  be  evaluated,  the  window  and  the  sinusoid.  In  the 
STFT,  note  that  the  window  size  is  fixed  as  a  function  of 
frequency  and  time.  Thus  as  the  sinusoid  is  scanned  over 
frequency,  the  number  of  cycles  in  the  window  varies. 
Thus  the  resolution  of  the  transform  varies  as  a  function 
of  frequency.  One  way  to  correct  for  this  (if  we  want  to) 
is  by  creating  a  window  which  always  contains  the  same 
number  of  cycles.  This  means  that  we  can  simply 
modify  the  STFT  so  that  the  window  is  a  function  of 


With  this  version  of  the  wavelet  transform,  we  can  form 
a  specific  relationship  between  frequency  and  scale.  In 
this  particular  case,  it  is 

scale  =  (f/F)(^*25) 

where  F  is  the  folding  frequency  and  F  =  1/6,  where  5  is 
the  sampling  time  increment.  In  this  case,  6  =  0.02  ns, 
and  thus  F  =  +/-  25  GHz. 

An  example  data  file  was  created  to  demonstrate  the 
behavior  of  the  wavelet  transform.  The  test  waveform  is 
the  sum  of  the  following  four  signals. 
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1 .  An  impulse  (a  single  data  point)  of  amplitude  three 
at  1 . 1  ns. 

2.  A  sinusoid  of  amplitude  1  and  frequency  2.3  GHz 
starting  at  4.3  ns. 

3.  A  sinusoid  of  amplitude  1  and  frequency  6.7  GHz 
starting  at  9.7  ns. 

4.  A  sinusoid  of  amplitude  1  and  frequency  12.4  GHz 
starting  at  12.4  ns. 

If  we  evaluate  this  waveform  using  the  STFT,  we  get  the 
result  shown  in  Figure  13.  Note  that  the  50  data  point 
span  is  such  that  the  start  time  of  the  STFT  response  is  at 
0.5  ns.  This  is  early  enough  to  capture  the  single  point 
event,  but  there  is  no  effect  that  can  be  seen.  The  first 
waveform  begins  at  4.3  ns  and  has  frequency  2.3  GHz, 
and  the  second  waveform  begins  at  9.7  ns  and  has 
frequency  6.7  GHz  as  expected.  The  response  images 
for  these  waveforms  are  very  clear.  The  third  waveform 
is  above  the  top  of  the  plot.  This  is  because  we  stopped 
plotting  the  response  at  9  GHz  because  the  impulse  radar 
has  an  upper  frequency  limit  of  approximately  8  GHz, 
and  we  wish  to  see  the  details  of  the  response  below  8 
GHz. 

The  wavelet  analysis  of  this  waveform  is  shown  in 
Figure  14.  In  this  figure,  the  impulse  can  be  seen  in  the 
upper  plot.  We  see  the  wavelet  response  of  the  impulse 
at  1 . 1  ns,  where  the  STFT  was  not  able  to  resolve  it. 

Note  that  most  of  the  energy  is  at  the  high  frequency 
limit  of  the  analysis,  as  we  expect  for  the  very  short  time 
impulse  modeled  here.  Next,  we  see  the  three  sinusoids 
starting  at  4.3,  9.7  and  12.4  ns.  They  have  scale  factors 
of  0.55  (2.3  GHz)  ,  0.72  (6.7  GHz)  and  0.8  4  (12.4 
GHz).  Note  that  all  three  waveforms  are  shown.  In 
fact,  the  scale  from  0  to  1  is  the  entire  frequency  span 
of  (0  to  l/2t,  where  t  is  the  time  increment). 

7.2.  Wavelet  Analysis  Applied  to  the  Tilted  Missile 
The  application  of  the  Continuous  Wavelet  Transform 
(CWT)  to  the  45  degree  orientation  data  is  shown  in 
Figures  15  and  16.  The  configuration  is  summarized 
below. 

•  Figure  15-  7USA3;  no  probe  internal  short 

•  not  shown  here-  7USA8;  no  probe;  internal  open 

•  not  shown  here-  7USA2;probe;  internal  short 

•  Figure  16-  6USA9;  probe;  internal  open 

Careful  observation  of  these  plots  does  not  show  any 
obvious  change  as  the  probe  is  installed  or  removed.  If 
we  make  a  careful  comparison  between  the  STFT  results 
and  the  CWT  results,  we  note  that  the  initial  response 
(2.2  ns  at  0.5  scale  =  1.56  GHz)  and  the  second  response 
(1 1.5  ns  and  0.5  scale  =  1.56  GHz  also)  are  very  similar 
in  relative  time  and  frequency  (1.5  GHz  corresponds  to 
0.5  scale  =  1.56  GHz). 


We  subtracted  pairs  of  data  from  the  above  data  sets  and 
formed  the  Wavelet  transform  on  the  difference 
waveforms.  The  plots  are  given  in  Figures  17  and  18, 
and  listed  below. 

Figure  17:  6USA9  -  7USA2;  probe  on  both;  internal; 
open  minus  shorted 

Figure  18:  6USA9  -  7USA3;  (probe;  open)  minus  (no 
probe;  short) 

In  the  subtracted  pairs,  clear  multiple  interaction  effects 
are  seen.  Both  show  clear  effects  at  the  time  delay 
corresponding  to  an  internal  bounce  off  the  inside 
bottom  (10  to  11  ns.;  0.75  scale).  If  both  data  sets  have  a 
probe  (Fig.  17),  no  change  is  seen  at  2  ns  where  the 
probe  scattering  effects  are  expected.  If  only  one  has  a 
probe,  a  clear  dispersive  probe  effect  is  seen  at  this 
location.  As  was  shown  in  the  subtracted  STFT  data, 
there  appears  to  be  an  internal  reflection  mechanism, 
where  the  signal  penetrates  into  the  missile  body  through 
the  probe  and  the  open  ended  coaxial  bulkhead 
connector  on  the  inside. 

We  especially  note  that  the  response  at  a  delay  of  6.5  ns 
after  the  initial  response  is  probably  "real."  It  is 
ambiguous  in  the  STFT  results,  but  seems  clear  and 
specific  in  the  wavelet  images.  Unfortunately,  if  we  look 
at  the  earlier  45  degree  write-up,  we  see  no  specific 
mechanism  that  corresponds  to  this  type  of  time  delay. 
This  response  mechanism  needs  more  study. 

7.3.  Wavelet  Analysis;  Summary 
The  wavelet  analysis  has  shown  very  close  agreement 
with  the  STFT  results  in  this  case.  The  mechanisms 
involved  in  the  mock  missile  do  not  have  a  form  which 
would  be  critical  to  the  differences  in  the  resolution 
abilities  between  the  STFT  and  the  CWT.  If  there  were 
more  dispersive  mechanisms  involved  in  the  scattering 
from  the  missile  shape,  then  the  differences  would 
become  important  because  the  curves  would  have 
specific  dispersive  signature  shapes  [12,  14,  15,  16,  18 
and  19]. 

We  have  shown  the  general  application  of  this  type  of 
analysis  to  impulse  measurements  such  as  can  be 
obtained  from  the  time  domain  range  and  the  d-dot 
probe.  Other  types  of  scattering  mechanisms,  such  as 
propagation  through  dispersive  media  (plasma  or  earth) 
or  scattering  from  resonant  targets  (cavities  or  ducts) 
would  more  critically  depend  on  wavelet  analysis. 


8.  BISPECTRAL  ANALYSIS 
8.1.  Bispectral  Analysis 

So  far,  we  have  been  discussing  transformations  from 
the  time  domain  to  the  time-frequency  domain.  The  goal 
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has  been  to  identify  the  spectral  response  of  various 
scattering  mechanisms.  The  direct  scattering 
mechanisms  are  relatively  easy  to  characterize  because 
there  is  a  direct  relationship  between  the  time  of 
occurrence  and  the  range  domain  (or  down-range) 
location  of  the  scatterer.  Many  of  the  terms  in  the 
response  profile,  however,  are  the  result  of  multiple 
interactions.  These  terms  are  difficult  to  specifically 
identify  because  they  appear  no  differently  in  the  down 
range  profile  than  the  direct  scattering  terms.  There  is  a 
difference,  however.  The  interactive  terms  are  related  in 
a  specific  way  to  the  two  direct  terms  involving  the 
interacting  scatterers.  An  algorithm  that  can  reveal  these 
relationships  is  the  bispectrum. 

The  bispectrum  is  defined  as  the  two-dimensional 
Fourier  transform  of  the  third  moment  sequence  or  the 
bicorrelation  function  R(/j,/2) 
scattering  measurements  in  the  time  domain,  h(t),  the 
bicorrelation  function  is  defined  as 

('t  =00 

Rh(/i,r2)=  I  h*(t)h(t+ri)h(t+?2)dt 

Jt=-00 

(remember  that  h(t)  is  scalar)  and  thus  the  bispectrum  of 
h(t)  is  defined  as 

ti=oot=oo 

r  r 

B(w,,W2)=j  j  R^(/],?2)exp(-7(wj?j+W2/2))d?i  df2. 

t]=-oo  t2=-oo 

Note  that  if  the  data  are  corrupted  by  noise,  the 
bispectrum  (which  represents  an  integration  over  the 
domain  of  /|  and  ^2  tends  to  the  bispectrum  of  the 
noiseless  data.  In  other  words,  the  triple  correlation  of 
the  noise  approaches  zero  [5  and  8]. 

An  alternate  formulation  of  the  bispectrum  is  to  take  the 
triple  product  of  the  spectrum  that  results  from  the 
Fourier  transform  of  the  impulse  response  h(t).  Thus  if 

(*t=oo 

S(w)  =  h(t)  exp(-ywt)  dt, 

t=-oo 

then  it  can  be  shown  that 

B(W|,W2)  =  S(W|)S(W2)  S*(wj+W2). 

This  is  the  generalized  form  of  the  bispectrum.  In  this 
form,  it  permits  relationships  between  spectral  responses 
at  two  different  frequencies  to  be  discovered.  Our 
problem  is  different,  however.  Our  problem  is  to 
discover  relationships  between  two  different  responses  in 
the  time  domain.  We  will  make  this  change  below. 


It  has  been  shown  for  radar  scattering  that  it  is  possible 
to  exchange  the  time  and  frequency  parameters  in  the 
above  argument  and  arrive  at  a  "bi-time"  response  or, 
since  time  is  related  to  distance  by  the  speed  of  light,  the 
"bi-range  profile"  of  a  radar  target  [8  to  1 1].  Using  a 
similar  argument  (with  t  and  co  interchanged),  we  arrive 
at  a  form  of  the  bi-time  response  as 

B(ti,t2)  =  h(t,)  h(t2)  h(tj+t2). 

Where  h(t)  is  the  (band  limited)  radar  target  impulse 
response.  What  we  are  saying  is  that  two  response  terms 
in  the  impulse  response  at  t|  and  t2  can  be  shown  to  be 
due  to  an  interaction  that  produces  a  third  term  at  t|+t2 
if  there  is  a  response  at  the  product  of  h(tj)  h(t2)  and 
h(t|+t2).  information  has  been  used  for  radar  target 
identification  [5,  6,  8  and  9]. 

The  above  formula  can  be  shown  to  only  apply  to  a  case 
where  the  multiple  interaction  is  along  the  axis  of 
propagation  of  a  monostatic  radar  beam.  It  also  has  a 
problem  with  the  zero  time  offset  [10].  In  the  next 
section,  we  will  modify  the  algorithm  to  apply 
specifically  to  our  45  degree  missile. 

8.2.  Bispectral  Analysis  for  the  45  Deg.  Case 

As  we  can  see,  the  bispectrum  can  be  used  to  analyze 
radar  direct  scattering  and  interactions  when  the  multiple 
scattering  is  along  the  same  axis  as  the  incident  beam.  In 
our  case,  we  have  a  missile  shape  tilted  at  a  45  degree 
angle  and  the  d-dot  probe  is  in  the  near  field  and  is  at  a 
distinct  bistatic  angle  with  respect  to  the  incident  signal. 
The  bispectral  algorithm  must  be  modified  to  fit  this 
specific  configuration  in  order  to  evaluate  multiple 
interactions.  Consider  the  geometry  of  a  45  degree  tilted 
object  as  shown  in  Figure  19. 

In  this  figure,  (1)  an  incident  signal  arrives  at  points  A 
and  B  and  scatters  bistatically  to  the  D-dot  probe,  (2)  the 
incident  signal  arrives  at  point  A,  scatters  to  point  B  and 
then  back  to  point  A,  and  then  to  the  D-dot  probe.  We 
wish  to  use  the  bispectral  concept  to  evaluate  the 
interaction  between  points  A  and  B.  Using  this 
geometry,  (and  the  known  value  of  the  intercept  point,  x) 
for  every  pair  of  points  in  time  t(A)  and  t(B), 
corresponding  to  the  reception  of  a  signal  from 
generalized  points  A  and  B,  we  can  compute  the  time  of 
the  interaction  term  t(C).  We  then  form  the  triple  product 

B(t(A),t(B))  =  t(A)*t(B)*t(C). 

The  two  dimensional  image,  B(t(A),t(B)),  is  then 
generated.  The  magnitude  of  the  response  at  each  point 
in  the  image  is  proportional  to  the  correlation  of  the 
product  t(A)*t(B)  with  t(C).  This  magnitude  will  be 
non-zero  when  there  is  an  interaction  term  between  A 
and  B. 
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Using  (relatively)  simple  trigonometry,  we  have  derived 
the  generalized  value  of  t(C)  for  given  values  of  t(A)  and 
t(B)  and  the  intercept  point  x. 

8.3.  Bispectral  Analysis  Applied  to  the  Tilted  Missile 
Data 

We  applied  the  45  deg,  modified  bispectral  analysis  to 
the  45  degree  missile  impulse  scattering  data.  The  45 
deg.  modified  bispectral  image  for  file  7USA3  (no 
probe;  internal  short)  is  shown  in  Figure  20.  Ignoring 
the  data  on  the  main  diagonal  (t(A)  =  t(B)  =  t(C)),  we 
can  see  several  specific  interaction  terms.  They  are 
listed  below.  All  timing/mechanism  estimates  are  based 
on  geometrical  considerations. 

1.  Vertical  index  65;  horizontal  1 10  (remember  that  each 
index  is  0.02  ns) 

This  timing  corresponds  to  an  interaction  between  a 
reflection  from  the  top  of  the  frustrum  and  the  bottom  of 
the  frustrum.  (Top  of  frustrum  to  bottom  of  frustrum  to 
d-dot). 

2.  Vertical  index  55;  horizontal  index  130 

This  timing  corresponds  to  an  interaction  between  a 
reflection  from  the  top  of  the  frustrum  and  something 
with  a  delay  time  1 .4  ns  longer.  There  is  no  obvious 
mechanism  except  perhaps  an  attached  mechanism  that 
forms  a  creeping  wave  around  the  back  of  the  missile 
cylinder/frustrum  and  returns  to  the  D-dot  probe.  More 
study  of  this  is  called  for.  It  may  be  a  false  correlation 
simply  because  of  an  overlap  with  other  terms, 

3.  Vertical  index  65;  horizontal  index  180 

This  timing  corresponds  to  an  interaction  between  a 
reflection  from  the  top  of  the  frustrum  and  a  point  3/4  of 
the  way  down  the  cylinder  part.  There  does  not  seem  to 
be  a  scatterer  at  this  point,  but  perhaps  we  are  actually 
seeing  an  interaction  between  the  nose  of  the  missile  and 
the  base  of  the  missile.  More  detailed  study  of  this 
algorithm  is  called  for. 

4.  Vertical  index  120;  horizontal  index  180 

This  timing  corresponds  to  an  interaction  between  a 
direct  creeping  wave  at  the  bistatic  45  degree  specular 
point  and  a  term  2.4  ns  later.  No  mechanism  is  seen 
there.  It  may  be  an  accidental  overlap  of  other  terms. 

5.  Vertical  index  180;  horizontal  index  210 

This  timing  corresponds  to  an  interaction  between  the 
near  side  of  the  base  and  the  far  side  of  the  base  of  the 
missile  shape. 

We  also  applied  this  algorithm  to  the  data  sets  from 
6USA8  (no  probe;  internal  open),  Figure  21,  7USA2 
(probe,  internal  short)  -  Figure  22,  and  6USA9  (probe, 
internal  open)  -  Figure  23. 


If  we  compare  these  figures,  we  see  a  change  as  we  go 
from  the  no-probe  case  to  the  probe  case  at  index  55 
vertical  180  horizontal.  This  timing  might  correspond  to 
an  interaction  between  the  probe  and  the  bottom  of  the 
missile  shape  (as  discussed  in  item  3  above).  This  is 
speculation. 

If  we  compare  the  internal  open  with  the  internal  short 
case,  we  also  see  a  possible  effect  at  55  vertical  and  180 
horizontal  (the  same  location  as  described  just  above  for 
the  probe  versus  no-probe  case).  Once  again,  we  can 
speculate  that  there  is  some  effect  involving  the  change 
caused  by  the  open  versus  short  case  which  effects  the 
probe  versus  bottom  of  missile  interaction. 

8.4.  Bispectra!  Analysis;  Summary 

In  summary,  what  we  see  in  the  image  seems  to  show 
multiple  interaction  terms  between  the  top  and  bottom  of 
the  frustrum  and  between  the  near  side  and  the  far  side 
of  the  base.  We  also  may  see  an  interaction  between  the 
frustrum  and  the  base  of  the  missile  shape. 

There  also  seem  to  be  some  "accidental"  bright  spots. 
These  could  be  caused  by  the  accidental  correlation  of 
some  strong  terms  in  the  time  domain  response.  Note, 
for  example,  the  off-diagonal  lines  in  the  image.  They 
originate  at  specific  bright  points  and  cause  "bright 
spots"  where  they  cross  the  horizontal  or  vertical 
response  lines  of  other  strong  scattering  mechanisms. 
These  intersection  spots  are  likely  not  true  interaction 
terms. 

Clearly,  this  modified  bispectral  interaction  analysis 
shows  promise.  We  may  actually  be  able  to  specifically 
identify  interaction  terms  in  the  impulse  response.  More 
research  on  this  modified  application  of  the  bispectrum 
is  called  for. 

9.  CONCLUSIONS 

9.1  Time-Frequency  Representations  for  Impulse 
Scattering  Analysis 

We  have  shown  applications  of  the  Short  Time  Fourier 
Transform,  the  Wigner  algorithm,  the  wavelet  transform 
and  the  bispectrum  to  the  problem  of  analysis  of  the 
resonant  behavior  of  a  generic  missile  shape.  The  missile 
was  tilted  45  degrees  toward  the  impulse  radar. 

All  three  analysis  techniques  were  able  to  show 

1)  the  direct  reflection  mechanisms  and  their  frequency 
spectrum 

2)  multiple  reflection  mechanisms  involving 

(a)  multiple  reflections  (interactions)  from  the 

nose  region  to  the  rear 

(b)  multiple  reflections  (interactions)  across 
the  nose  region  and  the  rear  region 
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Note  that  any  multiple  interaction  mechanism  is,  in  fact, 
a  type  of  resonance. 

A  particularly  effective  technique  was  to  subtract  two 
data  files  involving  only  a  specific  change  in  the  radar 
target  (with  and  without  a  probe,  for  example).  This 
technique  was  able  to  extract  specific  interactions  that 
could  not  be  seen  directly. 

Each  analysis  technique  had  its  own  area  of  specific 
application. 

(1)  STFT;  This  technique  is  linear,  robust  and  fast.  In 
the  target  under  test  here,  the  reflection  mechanisms 
were  far  enough  apart  to  permit  the  STFT  to  specifically 
identify  most  scattering  terms. 

(2)  The  Wigner  algorithm;  This  technique  has  more 
resolution  in  both  the  time  and  frequency  domain  than 
the  STFT,  but  the  non>linear  nature  of  the  transform 
generates  "mixing"  terms  in  the  image  that  can  be 
confusing.  Fortunately,  the  STFT  was  able  to  help 
distinguish  the  false  terms  from  the  real  terms. 

(3)  The  wavelet  transform;  This  technique  permits 
optimal  resolution  in  the  time  and  frequency  domains 
simultaneously  so  that  both  short  time  events  and  narrow 
frequency  events  can  be  revealed  in  a  single  image.  The 
logarithmic  frequency  index  (the  "scale"  index)  permits 
the  entire  frequency  span  to  be  seen.  Resolution  is 
optimized  to  this  scale  index,  rather  than  to  frequency. 

(4)  The  bispectrum;  The  bispectrum  is  specifically 
appropriate  to  reveal  multiple  interactions.  With  the 
earlier  algorithms,  it  was  necessary  to  identify  multiple 
interactions  by  their  location  in  the  time  domain 
(ambiguous  at  best).  The  bispectrum  permits  direct 
calculation  of  the  correlation  of  the  interaction  terms.  In 
this  specific  45  degree  missile  shape  example,  a 
modified  version  of  the  bispectrum  was  used.  The  result 
reveals  several  interaction  terms.  Interpretation  remains 
difficult  because  of  the  "accidental"  correlations  which 
occur. 

9.2  Evaluation  of  the  Missile  Shape 
We  see  direct  scattering  (at  1  GHz)  from  specular  and 
diffraction  sides  and  edges  of  the  missile.  The  frustrum 
is  a  particularly  strong  scatterer  when  the  missile  is  tilted 
toward  the  radar  d-dot  probe. 

We  see  interaction  terms  between  the  frustrum 
neighborhood  and  the  rear  of  the  missile,  as  well  as  what 
appears  to  be  a  resonance  involving  the  length  of  the 
body  (1  wave-  length).  Note  that  a  resonance  and  a 
multiple  interaction  are  identical  effects  in  most  cases. 

We  also  may  be  able  to  see  effects  due  to  the  internal 
resonances  of  the  missile  (differences  between  internal 


"short"  and  internal  "open").  These  effects  occur  when 
the  probe  on  the  nose  of  the  missile  conducts  energy 
directly  into  the  interior  of  the  missile.  They  are  easy  to 
confuse  with  external  attached  mode  multiple  interaction 
with  the  frustum  area. 
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Figure  2.  STFT  Algorithm  sketch. 
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Figure  3.  Raw  data  and  STFT  image  for  7USA3  (45°). 
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Figure  4.  Raw  data  and  STFT  image  for  7USA2  (45°). 
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Figure  5.  Raw  data  and  STFT  image  for  6USA9  (45°). 
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Figure  7.  Raw  data  and  STFT  image  for  6USA9  minus  7USA3  (45°). 
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Figure  8.  Raw  data  and  STFT  image  for  6USA9  minus  7USA2  (45°). 
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Figure  9.  Raw  data  and  STFT  image  for  7USA2  minus  7USA3  (45°). 
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Figure  10.  Raw  data  and  Wigner  algorithm  for  7USA3  (45°). 
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Figure  12.  Raw  data  and  Wigner  algorithm  for  6USA9(45°). 
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Figure  13.  Raw  waveform  and  STFT  image  of  theoretical  data,  (impulse  at  1.1  ns;  2.3  GHz 
starting  4.3  ns;  6.7  GHz  starting  9.7  ns;  12.4  GHz  starting  at  12.4  ns). 
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Figure  20.  Bispectral  image  of  7USA3. 
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Figure  22.  Bispectral  image  of  7USA2. 
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1.  INTRODUCTION 

1.1  The  Land  Mine  Pollution  Problem 

The  pollution  of  areas  with  large  quantities  of  anti-tank  and 
anti-personnel  land  mines,  especially  in  countries  of  former 
armed  conflicts,  like  Afghanistan,  Angola,  Cambodia,  Iraq, 
Kuwait,  Somalia,  Vietnam  and  Yugoslavia,  is  a  major  prob¬ 
lem.  According  to  United  Nations  estimates,  the  number 
of  uncharted  buried  anti-personnel  mines  exceeds  100  mil¬ 
lion,  in  over  60  countries  around  the  world.  The  rate  of  new 
mines  being  laid  is  about  one  million  per  year,  which  sur¬ 
passes  the  number  of  mines  cleared  by  a  factor  of  twenty. 
Some  2  million  mines  have  been  deposited  in  the  war-torn 
areas  of  former  Yugoslavia  alone.  Whole  areas  of  coun¬ 
tries,  especially  Cambodia,  have  been  severely  held  back 
from  further  development  [1]. 

According  to  the  Mine  Clearance  Planning  Agency  in 
Afghanistan,  over  a  period  of  15  years  an  estimated  20,000 
civilians  have  been  killed  and  400,000  wounded  by  land 
mines  in  that  country.  The  current  rate  is  4000  killed  and 
another  4000  wounded  annually,  world- wide.  Injuries  are 
horrific  and  usually  result  in  amputation;  returning  refugees 
to  conflict  zones  often  find  minefields  in  previously  farmed 
lands,  and  usually  have  to  clear  the  land  themselves.  Meth¬ 
ods  used  at  present  for  locating  and  clearing  mines  are 
painstaking,  costly,  time  consuming  and  highly  dangerous. 
Those  methods  include  the  use  of  sniffer  dogs,  magnetic 
mine  detection  aids  (e.g.  metal  detectors)  and  manual  prob¬ 
ing.  These  methods  are  very  slow  and  involve  teams  of  two 
working  their  way  along  rows  as  narrow  as  1  m  across.  For 
instance,  a  team  of  30  men  with  dogs  is  able  to  cover  only 
2000  squared  meters  per  day.  The  costs  of  such  clearance 
is  reported  to  lie  between  $  200  (US  dollars)  and  $  1000  per 
mine.  After  the  food  problem,  the  so  called  land  mine  pol¬ 
lution  problem  is  seen  as  the  biggest  humanitarian  problem 
in  the  world  [1]. 

Initial  moves  have  been  made  towards  a  world- wide  ban  of 
land  mines.  In  December  1993,  the  United  Nations  Gen¬ 
eral  Assembly  passed  a  non-binding  resolution  calling  for 
such  a  ban.  In  the  1981  United  Nations  Convention,  some 
rules  governing  the  use  of  land  mines  (considering  for  ex¬ 
ample  the  automatic  neutralization  of  land  mines  and  the 
obligation  to  record  pre-planned  land  minefields  by  means 
of  maps)  have  been  agreed  upon.  This  international  law 
regulating  the  use  of  land  mines,  the  198 1  Land  Mines  Pro¬ 
tocol,  at  present  only  regulates  the  use  of  land  mines  in 
wars,  but  not  in  internal  conflicts,  and  has  been  ratified  by 
only  39  countries.  Meanwhile,  new  minefields  have  been 
created  in  Georgia,  Armenia  and  Tajikistan,  for  example, 
as  well  as  in  the  territories  of  former  Yugoslavia. 

In  May  1996,  a  United  Nations  Review  Conference  of  the 


1981  Convention  finished  in  Geneva.  One  of  the  main  top¬ 
ics  was  land  mines.  Some  additional  rules  to  those  of  1981 
were  agreed  upon.  Internal  conflicts  are  now  also  covered 
by  the  convention.  It  was  further  agreed  that  land  mines 
must  have  a  self-deactivation  device  and  that  they  must  con¬ 
tain  at  least  8  g  of  iron,  to  be  detectable  by  the  current  types 
of  mine  detectors. 

The  above  discussion  on  the  land  mine  pollution  problem 
demonstrates  the  need  to  develop  new  technologies  to  in¬ 
crease  the  efficiency  and  to  reduce  the  costs  of  mine  clear¬ 
ing  operations. 

1.2  Electromagnetic  Inversion 

The  detection  of  objects  buried  in  the  ground  is  in  gen¬ 
eral  difficult.  An  even  bigger  problem  is  discrimination  or 
classification  of  the  buried  object.  The  first  problem  is  the 
ground  itself.  It  is  usually  very  inhomogeneous  and  has  a 
complicated  layered  structure,  often  containing  rocks  and 
voids.  Moreover,  many  other  objects  like  metal  cans  can 
be  present  in  the  ground.  Without  a  reliable  identification 
method,  the  false-alarm  rate  of  a  ground-penetrating-radar 
system  would  be  so  high  that  the  cost  of  clearing  of  a  mine¬ 
field  would  be  prohibitive. 

The  objective  of  this  lecture  on  inverse  methods  is  to 
present  some  methods  to  carry  out  detection  of  buried  ob¬ 
jects.  Subsequently,  the  reconstructed  data  is  available  to 
dedicated  pattern  recognition  algorithms  to  obtain  classifi¬ 
cation  of  the  detected  object. 

In  general  the  electromagnetic  data  is  obtained  by  using 
fixed-frequency  systems  operating  at  a  single  or  multiple 
frequencies  or  ultra- wideband  systems.  The  term  ultra- 
wideband  is  used  in  all  situations  where  one  deals  with 
pulses  of  extremely  short  duration.  A  pulse  of  almost 
zero  duration,  which  approximates  a  delta  function,  con¬ 
tains  almost  all  frequencies.  Hence,  short  pulses  are  ultra- 
wideband  pulses.  It  has  been  claimed  by  some  authors  that 
UWB  systems  have  many  advantages  compared  to  fixed- 
frequency  systems  when  used  for  probing  the  ground  [2], 
[3].  In  a  UWB  system,  like  a  ground-penetrating-radar  sys¬ 
tem,  a  large  amount  of  individual  frequencies  are  applied 
towards  the  object  of  interest. 

1.3  The  EM-Inversion  Methods  Presented  in  this  Lec¬ 
ture 

The  first  method  that  is  presented  is  denoted  as  ’’Microwave 
Image  Reconstruction  Methods”  by  S.  Primak  et  al  This 
method  uses  a  special  mapping  of  gathered  data  and  in  this 
paper  a  tutorial  type  overview  of  microwave  tomographic 
imaging  is  given.  The  second  method  that  is  presented  is 
denoted  as  ’’Two-Dimensional  Inverse  Profiling:  Nonlinear 
Optimization  and  Embedding”  by  A.  Tijhuis  et.  al  In  this 
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paper,  a  method  is  presented  to  solve  nonlinear  inverse  scat¬ 
tering  problems.  This  approach  is  based  on  the  availability 
of  efficient  iterative  solvers  to  carry  out  the  electromagnetic 
computations.  The  last  method  discussed  in  this  lecture  is 
denoted  as  ”Non-linear  Inversion  Based  on  Contrast  Source 
Gradients”  by  van  den  Berg  et.  al.  This  method  is  an  algo¬ 
rithm  for  reconstructing  the  complex  index  of  refraction  of 
a  bounded  object.  Also,  this  method  incooperates  efficient 
iterative  solvers. 
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1.  SUMMARY 

In  this  paper  we  give  a  tutorial  type  overview  of  microwave 
tomographic  imaging,  or  inverse  synthetic  aperture  ISAR  (and 
SAR)  imaging,  of  perfectly  conducting  bodies  by  considering 
the  simplest  high-frequency  model  of  scattering  and  we  review 
methods  allowing  one  to  obtain  these  images. 

2.  IDEAL  POINT  SCATTERER  MODEL  AND  ISAR 

ISAR  (SAR)  imaging  provides  a  way  to  map  data  gathered  at 
multiple  frequencies  and  aspect  angles  into  a  two-dimensional 
image,  where  the  points  correspond  to  x-y  coordinates  in 
space  (called  down- range  and  cross-range  respectively)  [3], 
An  intuitive  explanation  based  on  ideal  point  scatterers  will  be 
given  in  this  section. 

Consider  a  monochromatic  electromagnetic  plane  wave, 
having  a  frequency  co  ,  incident  at  an  angle  (p. ,  on  a  target 
containing  an  ideal  scattering  centre  of  strength  A  at  position 
(A(),  Vq)  (see  Fig.  1).  The  incident  plane  wave  is  represented  as 

=  exp(-yfc  •  r-i-yCOO  (1) 

where  k  =  k  (fl^.cos(p-  +  fl^.sintp-)  =  -i-  a^,ky  is  the  wave 

vector  with  k  -  co/c  (c  is  the  speed  of  propagation), 
r  =  d^x  +  d^,y  is  the  position  vector  in  the  plane,  d^  and  dy 
are  the  unit  vectors  in  x  and  y  directions,  respectively.  The 
resulting  backscattered  field  can  be  written  as  [3] 

=  A  exp(-  2jk^XQ  -  2jkyyQ)&xp{jk  •  r  +  y(or)  (2) 

Dividing  the  scattered  field  by  the  incident  field  and 
multiplying  by  exp {-2k  •  r)  one  obtains 

5(0),  (p.)  =  Aexp(- 2y/:AQCOS(p^  -2y/:yQsin(p.)  (3) 

If  we  take  the  2-D  inverse  Fourier  transform  of  (2)  with  respect 
to  2k ^  and  2k^.  we  obtain 

s{x,y)  =  A5(x-Xo)5(y-yo)  (4) 


Figure  1 .  Incident  wave  vector  and  ideal  scattering  centre. 

This  is  the  essential  idea  behind  ISAR  processing:  ideal 
scattering  centres  correspond  to  ideal  points  in  the  ISAR  image 
[3,4]. 


3.  METHODS  OF  IMAGE  CONSTRUCTION 

As  we’ve  seen,  but  not  proved,  the  scatterer  geometry  is  simply 
the  2-D  inverse  Fourier  Transform  of  the  reflectivity  of  the 
object  in  the  k^-k^  plane.  Methods  of  image  construction 
from  measurements,  based  on  the  direct  application  of  the  2-D 
inverse  Fourier  Transform  as  well  as  alternative  methods, 
making  use  of  the  so  called  Central  Slice  Theorem,  will  now  be 
discussed. 

3.1.  2-D  inverse  Fourier  Transform 

As  has  just  been  discussed,  an  image  of  the  geometry  of  a 
scatterer  can  be  constructed  by  an  inverse  2-D  Fourier 
Transform  of  the  scattered  signal  in  the  frequency  domain,  that 
is  [5,  6] 

six,  y)  =  ^  J  S(k^,  k^t\'p[2jik^  +  kyy)]dk^dky  (5) 

This  expression  gives  us  a  direct  method  of  recovering  the 
image  from  the  measurements.  Unfortunately,  the  discrete 
version  of  (5)  requires  the  uniform  rectangular  sampling  of 
information  in  the  k^-k^  domain,  while  the  measurements 
are  usually  taken  in  the  (0  -(p^-  domain,  which  is  non-uniform 
in  the  k^  -  ky  domain.  Thus  some  kind  of  interpolation  has  to 
be  used  (see  [5]  for  details). 

3.2.  Filtering  and  backprojection 

An  alternative  method  will  now  be  described  (see  [6,  7,  11]). 
Using  the  fact  that  =  (co/c)cos(pp  and  =  (a)/c)sin(p^- 
one  can  rewrite  integral  (5)  as 

i(jr,>>)  =  ^^Sik^,ky)tx’pi2jik^x  +  kyy)]dk^dky 
^  ' 

OO  Jt  (6) 

=  1  f  fs 

J  J  V, 

— ooQ 

where  S^ica)  =  5(co,  (p-)  is  the  slice  of  the  frequency  domain 
image  (frequency  response  of  the  of  the  target)  taken  at  the 
angle  tp- .  Using  p  =  Jx'^  -i-  y^  and  P  =  atan(y/x:) ,  we  get 

OO  Tl 

^(p^P)  =  1  C  fs  (co)e2>»P(cos|5cos(p,+  sinPsin<p,)|^|^^^^_ 

-cO  (7) 

0  -<» 

Therefore  the  shape  of  the  scatterer  can  be  obtained  by  first 
filtering  the  frequency  domain  slices,  that  is  obtain  the  filtered 
signal 
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ypcos(p-(p,))  =  ^ 

=  /'^pcosdJ-.p.oIlwIV®^^ 


(8) 


oo  % 

six,  ^  J  (]  1) 

-ooO 

or,  using  the  variables  p  and  p  ,  as 


This  is  just  the  1-D  inverse  Fourier  transform  (using  the  spatial 
parameter  2pcos(p  -  (p.) )  of  the  frequency  domain  slice, 
taken  at  angle  (p-  and  multiplied  by  |co|  .^Alternatively,  it  is  the 
spatial  domain  slice  convolved  with  h{t)  =  F'“*[|co|]  [6]. 

The  next  step  is  called  back-projection  of  the  filtered  slices: 

TT 

J(P,P)  =  ?jrf(p,ypcos(p-(p,.))  (9) 

0 

This  states  that  the  reconstructed  function  ^(p,  p)  is  the  result 
of  averaging  the  signal  5^  (pcos(p  -  (p.))  with  respect  to  tp^ 
which,  in  turn,  is  the  back-projection  of  the  signal  5(p  (x)  along 
the  line  in  the  same  direction  in  which  the  projection  function 
is  obtained  (see  Fig.  2).  Thus,  the  reconstructed  pixel  is  the 
averaged  back-projection  of  the  measurements,  taken  all 
around  the  object.  Since  the  filtered  version  of  the 
measurements  is  used,  this  algorithm  is  called  filtered  back- 
projection  [6].  From  equation  (9),  the  discrete  approximation 
of  the  reconstructed  function  5(p,  p)  can  be  obtained  as 
follows 


7r 

.s(p,  P)  =  p/<p,S^(pcos(P-9;)) 

0 

N-  I 

=  ^  Z  J„(pcos(P-7i;j/A/)) 
<1  =  0 


(10) 


oo  7t 

,(P,p)  =  Ij 

— ooQ 

oo  % 

=  1  J  j5^  (a))^.2y“pco.s(P-9,)|Q,|rf(0tl<p.  (12) 

— ooQ 

oo  TL 

-ooO 

and,  finally, 

K 

i(P,  p)  =  \|co|)  *  p^(pcos(P  -  9,))c/(|),.  (13) 

0 

where  denotes  2-D  convolution.  The  final  integral  term  in 
(12)  represents  the  back-projection  of  the  signal,  restored  from 
the  frequency  domain  measurements  without  any  filtering,  ie. 

S^ix)  =  ^  J  (14) 


but  to  restore  the  true  function  .y(p,  (pp  one  has  to  convolve  the 
result  of  the  back-projection  with  a  point-spread  function  [6] 

F“^(|co|)  =  p  \  p  =  -f  y'^  (15) 


which  can  be  performed  sequentially  as  a  new  measurement  is 
obtained.  Thus,  the  filtered  back- project! on  algorithm  employs 
only  a  series  of  I-D  inverse  Fourier  transforms  and  does  not 
require  the  complete  data  set  to  start  reconstruction.  This 
makes  the  algorithm  the  best  choice  for  reconstruction  of  the 
function  from  its  projections  [6,  13]. 


.y(j:cos0  -h  ysinB) 


3.3.  Back-projection  and  filtering 

The  filtered  back-projection  is  not  the  only  way  to  reconstruct 
the  desired  function  from  the  measurements.  In  fact,  equation 
(5)  can  be  rewritten  as 


As  we  can  see,  this  algorithms  allows  the  reconstruction  of  the 
image  in  two  steps,  one  of  which  requires  a  two-dimensional 
convolution  with  a  singular  function.  Where  only  a  rough 
image  of  the  body  is  required,  the  convolution  may  be  omitted. 

4.  PHYSICS-BASED  SCATTERING  CENTRE  MODEL 

The  previous  considerations  were  based  on  the  following 
assumptions: 

1)  that  scattering  from  a  complex  object  can  be  thought  of  as 
scattering  from  a  relatively  small  number  of  ideal  point 
scatterers  [3, 4];  and  that 

2)  measurements  are  available  for  all  incident  angles  in  the 
x-y  plane  or,  at  least,  for  a  discrete  set  of  angles, 
covering  the  range  [0,  27i:] ,  [6,  7]. 

In  practice,  any  of  these  assumptions  may  become  invalid.  Due 
to  the  limited  frequency  content  of  the  excitation,  especially 
taking  into  account  its  base-band  nature,  the  real  physical 
object  cannot  be  represented  as  a  set  of  ideal  point  scatterers 
[7,  8].  In  fact,  the  reflection  coefficients  depend  on  frequency 
and  angle  of  incident.  Also,  if  a  wide  range  of  angles  is  used 
some  scattering  centres  may  become  invisible  for  some  of  the 
aspect  angles.  Cavities,  ducts,  and  other  structures  present  in 
real  targets  produce  scattered  signals  that  have  a  very  strong 
frequency  dependence  which  cannot  be  “focused”  on  the  ISAR 
(SAR)  image. 


5-3 


For  realistic  targets  it  is  almost  impossible  to  obtain  the 
information  about  the  scattered  waves  for  a  wide  range  of 
aspect  angles.  This  is  why  many  authors  have  considered 
microwave  imaging  using  the  back-projection  algorithm  as  a 
narrow-band  (with  respect  to  the  angular  variable)  technique. 

Another  physical  phenomena,  inherently  presented,  is  the 
dependence  of  the  scattering  parameters  on  the  polarization  of 
the  incident  wave. 

In  this  section  we  will  present  more  realistic  models  of  the 
targets,  based  on  the  geometrical  theory  of  diffraction  (GTD) 
[9L 

4.1.  GTD  scattering  model 

The  GTD  scattering  model  is  valid  under  the  assumption  that 
the  wavelength  of  the  incident  excitation  is  small  relative  to  the 
targets  extent  [9].  It  represents  the  scattered  field  as  originating 
from  a  set  of  M  discrete  scattering  centres  located  at  points 
>’/»)  i.e. 

E  ^ 

£'(co,  (p,.,  t)  =  — ^  S,„((0,  (p,.)exp[-;(07',„]  (16) 

n,  =  I 

where  oo  is  the  angular  frequency,  Cq  is  the  propagation 
velocity, 


The  angle  dependence  of  the  magnitude  of  the  scattered  field  is 
a  much  more  complicated  question.  Most  man-made  targets, 
containing  sharp  edges  can  be  characterized  by  a  sinc-like 
aspect  dependence  of  the  scattering  coefficient.  In  practice, 
especially,  for  the  narrow-band  angle  ISAR  (SAR),  this 
dependence  can  be  accurately  approximated  by  the 
exponential,  Le.  [10] 

5(0),  (p,.)  =  yl„,(iQ))“»'eP'P'  (19) 

Thus,  GTD  predicts  the  following  model  for  the  scattering 
from  large  targets 

M  -2./—  cos  (p,  +  .sin  (p, ) 

p((0,  (p,)  =  ^  /l„,(ycci)“"'el^'‘’‘e  (20) 

m  =  1 

which  sometimes  results  in  ISAR  (SAR)  image  of  a  target 
different  from  those  obtained  using  PO  approximation. 

4.2.  ISAR  image  of  the  point  scatterer  from  narrow  band 
of  angles 

If  the  measurements  are  performed  in  a  narrow  range  of 
incident  angles,  say  <  q)  <  ,  and  in  the  frequency 

range  <  ox  ,  then 

I*  |<^  I*  |<^sina  (21) 

1  Co  I  >1  Co 


Tn,  =  rC-^mCOStp^  +  y^sintp,.)  (17) 

is  the  round  trip  delay  of  the  m  -th  scattering  centre,  and  M  is 
the  number  of  scattering  centres.  The  amplitude  5^(co,  (p-)  is  a 
frequency  and  angle  dependent  coefficient  determined  by  the 
geometry,  composition,  and  orientation  of  the  scattering 
mechanism.  The  normalized  field  for  a  given  polarization 
(suppressing  the  dependence)  is  then 


M 

P((0.  (P,.)  =  ^  S„,((0,  (P;) 

/«=  1 


-2  ik{x„,cos<Pj  +  v,„  sincp,.) 
e 


(18) 


For  frequency  dependence,  GTD  predicts  that  the  scattering 
amplitude  is  of  the  form  (yco)"'”  where  is  an  integer 
multiple  of  0.5.  Table  1  shows  the  relation  between  different 
values  of  the  parameter  and  the  geometry  of  the  scatterer 
[10]. 


Table  1.  Relation  between  a  and  scatterer  geometry 


Value  of  a 

Canonical  scatterers 

1 

Flat  plate  at  broadside;  dihedral 

1/2 

Singly  curved  surface  reflection 

0 

point  scatterer; 

doubly  curved  surface  reflection 
straight  edge  specular 

-1/2 

curved  edge  diffraction 

-1 

corner  diffraction 

and  the  image  of  the  scatterer  is  given  by 

^\sina 
^•*0  ^0 


six,y)  =  \j  J 


"nin.r  ^nia.r 


■  r“' 

sin  — 

1  L  c, 


— (A;-jro)l  sinr-^(>-yo)sinal 
0  J  L  J 


(22) 


co„ 


-(y->’o)s'n« 


'-0  •'0 
which  approximately  represents  an  ellipse  in  the  ISAR  plane. 


4.3.  Resonant  scattering 

As  was  mentioned  before,  the  low  frequencies  in  the  incident 
waveform  can  excite  the  resonance  of  the  target  [14].  This 
effect  can  be  described  by  an  extra  term  in  the  frequency 
response  of  the  target.  If  we  assume  that  the  resonant  response 
is  localized  at  the  point  (xQ^y^)  and  the  frequency  of 
oscillations  is  cOq  ,  with  a  damping  factor  is  a ,  and  an  angle 
dependence  which  is  proportional  to  a  decaying  exponential 


5-4 


term  exp((3(p-) ,  then  the  extra  term  due  to  the  resonance  can 
be  written  as 


5/co,  9.)  =  exp(|3cp.) 


r  ^  .CO  .  .0)  .  A 

exp  -  2j--XqCOS(P- - 2;~yoSin(p; 

\  c  cj 


a  +  ;(co-a)o) 


If  we  assume  also  that  data  are  gathered  over  a  narrow  range  of 
angles  -a  <  (p  -  <  a  ,  equation  (23)  can  be  rewritten  as 


5/00,  9  •)  ^ 


f  ^  .Qd  ^  .CO 
exp[-2;-Xo-2;-yo(p. 

a  +  y(co-(Oo) 


thus,  the  image  in  the  ISAR  plan  will  be  given 


a  expr-27“xo-27®<p,.) 

I  1 - gT7(I^-(Oo) - 


a  exp|27“[(x-Xo)  +  (y-yo)(p,.]l 

r  jN  ^  °  ^ 

J  J  a  +  7((o-o)o) 


sing(y->-o)sina] 


-(>'->'o)sina 


u(x-x^) 


where  C  is  a  constant  and  w(-)  stands  for  the  unit  step 
function.  The  corresponding  ISAR  image  is  the  “cloud”, 
spreading  through  the  down-range  shown  in  Fig.  4.  This  cloud 
not  only  makes  the  image  more  crowded,  but  may  in  some 
cases  obscure  some  other  important  scattering  centres. 


As  an  example,  the  finite  difference  time  domain  (FDTD) 
method  was  used  to  obtain  synthetic  time-domain  data  for  the 
monostatic  scattering  from  the  perfectly  conducting  cavity 
shown  in  Fig.  5  [15].  Data  was  taken  for  plane  waves  incident 
with  the  electric  field  polarized  in  the  z-direction.  The  angle  of 
incidence  was  varied,  in  the  x-y  plane,  by  5  degrees  over  a  180 
degree  sweep  of  the  target.  The  ISAR  image,  constructed  using 
the  FDTD  data,  is  shown  in  Fig.  6.  Note  that  although  the 
outline  of  the  cavity  can  be  discerned,  the  artifacts  due  to  the 
resonant  nature  of  the  target  tend  to  obscure  the  image. 


5.  Physical  Optics  Approximation  Of  Scattering 

Instead  of  approaching  the  problem  from  the  geometrical 
optics  point  of  view  one  could  formulate  the  high-frequency 
scattering  problem  using  physical  optics. 


5.1.  High  frequency  scattering  by  a  PEC  body 
Consider  a  perfectly  conducting  body  illuminated  by  a  plane 
monochromatic  wave,  as  shown  in  Fig,  7.  The  surface  of  the 
scatterer  is  denoted  by  5  and  its  volume  by  V.  Using  the 
Stratton-Chu  formula  [16]  for  the  direct  integration  of  the 
vector  electromagnetic  wave  equation,  the  magnetic  field 
scattered  by  the  object  can  be  written  in  terms  of  the  magnetic 
and  electric  fields  on  the  surface  of  the  body.  Since  the 
boundary  conditions  are  /i  x  £  =  0  and  n  ■  H  =  0  the 
integral  equation  simplifies  to 


Down-range,  units 

Figure  4.  ISAR  image  of  a  resonant  scatterer. 


Figure  5.  Open  ended  cavity  as  a  scatterer. 


0  5  10  15  20  25  30  35  40  45 

Figure  6.  ISAR  image  of  rectangular  cavity. 

=  ^[(«  X  )  X  VG^ds  =  -^VC  X  (n  X  H)ds  (26) 
s  s 

which,  in  turn,  can  be  rewritten  for  the  far-held  zone  as 

=  -jVG  X  («  X  H)ds  =  -jG x  (n  x  H)ds  (27) 
s  s 
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Here  n  is  the  unit  vector  normal  to  the  surface  of  the  scatterer, 
H  is  the  total  magnetic  field,  G  stands  for  the  appropriate 
Green’s  function,  and  are  the  incident  and  scattered 
field  wave  vectors. 


Figure  7.  Geometry  of  the  problem 

For  scatterers  that  are  large  compared  to  the  wavelength  of  the 
excitation,  the  physical  optics  approximation  states  that  the 
total  magnetic  field  H ,  tangential  to  the  surface  is  twice  the 
incident  tangential  magnetic  field  on  the  illuminated  side  of  the 
scatterer  and  zero  on  the  shadow  side,  Le., 


nxH  = 


2n  X  Hj,  •  «  <  0 
0,  k^--n>0 


H.  = 


Recalling,  that  the  three  dimensional  Green’s  function  for  free 
space  is 


G  =  -j—.  r  = 
47cr 


one  can  rewrite  (31)  in  the  form 


(35) 


In  the  far  field,  we  have 


2j^»2jSk  =  ±  and  (r-x')  ^  . 

4nr  4nr  ^ 

thus,  (35)  can  be  rewritten,  for  the  monostatic  case,  as 


This  approximation  allows  us  to  rewrite  (27)  in  the  following 
form 


where  ^3  =  Here  p(^3)  represents  the 

normalized  monostatic  response. 


{^  =  -2 j  J  X  (n  X  H.)ds 

^3  •  •  H  <  0 

=  2j  J  C[H,.(kj,^-n)-nikj,-H,)]ds 


Let  5^  be  the  surface  illuminated  by  the  plane  wave 
propagating  in  the  direction  given  by  the  wave  vector  . 
Then,  assuming  a  convex  object: 


If  the  incident  field  is  taken  as  a  plane  wave,  then  for  the 
monostatic  case  -  -k^j  and  thus  k^-  H-  =  0  which 
reduces (29) to 


H^=  2j  \GH^(k^^  n)ds 


where  S_j^  is  the  part  of  the  surface  S  illuminated  by  the 
wave,  propagating  in  the  opposite  direction  to  the  wave 
described  by  wave  vector  ,  ie.  is  the  part  of  the  surface 
illuminated  by  the  wave  propagating  in  the  direction  described 
by  the  wave  vector  -k^  ,  For  this  case  (37)  can  be  rewritten  as 


From  this  equation  it  can  be  seen  that  the  polarization  of  the 
scattered  magnetic  field  yielded  by  the  physical  optics 
approximation  is  co-linear  with  the  incident  magnetic  field 
polarization.  Thus,  no  cross-polarization  information  can  be  ■ 
obtained  from  the  physical  optics  approximation.  Equation 
(30)  can  now  be  rewritten  without  loss  of  generality  in  terms  of 
the  complex  spatial  vector  magnitude  of  the  incident  and 
scattered  fields  as 


P*(-*3)  =  4  1  e-^^'‘y%-ds 


In  the  far  field,  the  scattered  field  can  be  considered  a  spherical 
wave,  thus 


while  the  incident  field  is  the  plane  wave 


Adding  (37)  and  (40)  yields 


p(*3)  +  p*(-*3)  =  ^  J  ■  ^*3  •  ds  + 

■j=  f  ■'*3  •  ds  =  •'t  .  ds 

According  to  Gauss  theorem,  the  surface  integral  can  be 
transformed  to  a  volume  integral  of  the  form 


pCfcj)  +  p’‘(-*3)  = 

Jk^ 

9^2 
nr  J 


5.2.  The  inverse  problem 

Equation  (42)  allows  us  to  solve  the  direct  problem,  i.e.  to  find 
the  scattered  far  field,  if  the  shape  of  the  target  is  known.  A 
small  modification  of  (42),  suggested  in  [17]  allows  to  solve 
the  inverse  problem,  Le.  to  restore  the  shape  of  the  object  if  the 
scattered  field  is  known.  This  will  now  be  explained. 

Let  us  introduce  the  following  membership  function 


1,  Vjte  T 

0,  Vx  €  K 


Then  the  integral  (42)  can  be  rewritten  as 


3)  +  pV*3)  = 

a/TC  y 


rr  J 


,  1,  V/Ca  for  which  p(/ro)  is  known 

A(*3)  =  ’  3  (47) 

0,  VA:3  for  which  p(A:3)  is  unknown 


and  let  a{x)  be  a  three-dimensional  Fourier  transform  of 
A{k^)  : 


=  ■~^\A{k^)dk^ 


Then,  equation  (46)  can  be  rewritten  as 

=  !±!,.  r  .*  ,4„ 

‘  i  ‘J 

where  star  denotes  3-D  convolution  operator.  Equation  (49) 
shows  that  in  the  case  of  incomplete  angular  information  the 
reconstructed  ISAR  (SAR)  image  differs  from  those 
reconstructed  from  the  complete  information.  It  is  3-D 
convolution  of  the  “complete”  image  with  some  aperture 
function  which  depends  only  on  part  of  the  frequency  space 
where  measurements  are  available. 

5.4.  Two-dimensional  thickness  function 

If  the  scattering  information  is  restricted  to  the  two- 
dimensional  {k^,  kf)  plane,  then  the  scattering  information 
aperture  must  be  chosen  as  the  one-dimensional  Dirac-delta 
function  in  the  orthogonal  direction  to  this  plane,  Le.  [18] 


A{kf)  =  a(fcp 


Thus,  according  to  (48) 


r-[p(^3)  +  P  W]  r  7ik  r 

. -  ^ ^  i 

3  /?3 

The  right  part  of  equation  (45)  is  the  three  dimensional  Fourier 
transform  of  the  membership  function  yC-^)  »  and  can  be  easily 
inverted  to  produce 


“  J.  ‘i  ■’ 


,-5/2r  r 


16  J  3 


Thus,  if  the  response  from  the  target  is  known  for  all  possible 
wave  vectors  k^  ,  then  the  membership  function  Y(x)  can  be 
exactly  reconstructed. 

5.3.  Case  of  incomplete  scattering  information 

Let  a  scattering  information  aperture  function  A(^3)  be 
defined  as  unity  or  zero  for  all  values  of  k^  for  which  the 
scattered  response  is  known  or  unknown,  respectively,  i.e. 


a(x)  =  J  •  ’‘5(k^)dk^  =  ^5(a:)5(j-)  (51) 


and  the  integral  equation  (49)  can  be  rewritten  as 


T(x,  y)  =  j  yix,  y,  z)dz 


=  E^Jj^dkJe-^^'‘r^dz 
^3  kj 


and,  finally,  taking  into  account  that  k2  =  k^  in  this  case 

nx,  y)  =  J  (53) 

It  can  be  easily  seen  from  the  equation  (52),  that  the  two- 
dimensional  function  T(x,  y) ,  which  can  be  restored  from  two- 
dimensional  measurements  represents  the  dimension  of  the 
body  in  the  z  direction,  i.e.  its  thickness  [18,  19]. 


(60) 


5.5.  Conducting  plate  example 

Consider  a  perfectly  conducting  rectangular  plate  of  dimension 
2ax2bx  2c  centred  on  the  origin.  According  to  (42)  one  can 
write 


In  the  very  same  manner  one  can  obtain  that 


=  S  j  J  J  (54) 

/JK 

-a  -h  -c 

sin(2k^a)  smi2kyb)  sin(2k^c) 

Jn  K 

If  the  measurements  are  obtained  only  in  the  x-y  plane  than 
the  z  component  of  the  vector  is  equal  to  zero:  =  0 

and  (54)  can  be  rewritten  as 


p{k^)  = 


2\k^^sm{2k/i)sm{2k^b) 


2c 


4c,,  9  ,^,sin(2/c^a)sin(2/c/) 


(55) 


The  next  step  is  to  use  the  following  relation  between  the 
components  of  the  k2  vector  and  polar  coordinates  on  the 
x-y  plane 


k,  =  — cos(p,  =  — sm(p 

^0  •  ‘'o 

After  plugging  (56)  into  (55)  one  can  find  that 


p(©,  tp)  = 


4c  ©^ 


sinf  2— flcoscp  1  sinf  2— 6sin(p 
V  Cq  j  \  Cq 


4^4  CO 


COS(p 


©  . 

—  sintp 

^0 


4c 


^cos(psin(p 


—  s\n{T^<o)sin(Ti,(o) 


(56) 


(57) 


-I6c  1 


^ _ [^KT„  +  n)iO  ^  ^-j(T,  +  7’„)co 


^  COS  (p  sin  (p 

where  we  have  used  the  following  notations 


^  ^  ^  2^  (58) 

"  c-o  ’  *  Co 

If  (p  =  0  or  (p  =  71  then  the  denominator  in  (57)  becomes 
zero  and  the  true  value  of  the  p((p,  ©)  can  be  calculated  as  a 
limit: 


p(7tn,  ©)  =  lim  p((p,  ©) 

(p  — >  Tin 


and  thus,  the  corresponding  time  response  is 


3i(±%  t)  =  ^|-r5(i  -  — )  -  5(f  +  — )]  (61) 

2  4%dtl  cq  Co  j 

For  any  other  angles,  the  inverse  Fourier  transform,  applied  to 
the  equation  (57),  gives 

f»/  N  16c  1 

9t(t,  (p)  =  — - ^ - 

^coscpsmtp 

[6(j-r,-r2)  +  6(<  +  r,  +  r2) 

-(5(f-7, +r2)-S(f +  7,-72))  ] 

Corresponding  ISAR  images  are  shown  in  Fig.  8-9.  It  can  be 
clearly  seen  from  the  Fig.  8  that  scattering  from  the  edges  of  a 
plate  are  dominant,  thus  producing  image  similar  to  “optical” 
image  of  the  plate,  but  contaminated  by  artifacts.  At  the  same 
time,  a  narrowband  image,  shown  in  Fig.  9  emphasizes  point 
scattering,  i.e.  only  sharp  changes  in  target  shape  can  be 
observed. 


Figure  8.  ISAR  image  of  a  rectangular  plate  (PO  data). 


Yet  another  important  feature  can  be  seen  from  Fig.  9.  Instead 
of  a  single  point,  representing  a  nearly  isotropic  point  scatterer, 
two  or  three  different  scatterers  can  be  observed.  This 
emphasize  the  difference  between  PO  and  GTD 
approximations.  However,  for  the  case  of  a  rectangular  plate  it 
does  not  introduce  a  big  difference.  The  results  can  be  quite 
different  if  the  target  has  a  few  closely  spaced  sharp  edges. 


6.  CONCLUSIONS  AND  MODERN  TRENDS 

In  this  paper  we  have  briefly  reviewed  the  background  of  ISAR 
(SAR)  microwave  imaging  and  have  considered  the  main 
methods  of  image  reconstruction  from  the  projections.  As  we 
have  seen,  the  ISAR(SAR)  image  does  not  coincide  with  the 
“true”  optical  image  of  the  target.  Nevertheless,  it  seems  that 
such  an  image  reflects  deep  physical  properties  of  the  target 
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under  investigation  and  thus  can  be  used  in  various 
applications. 

The  different  techniques  considered  in  this  paper  are  applied  to 
the  simplest  case  of  PEC  targets  in  free  space.  More  general 
setups  have  been  extensively  studied  during  the  last  decade. 
One  branch  of  this  study  is  concerned  with  transparent  and 
semi-trasparent  dielectric  bodies.  It  was  found  that 
backprojection  type  algorithms  can  be  used  to  restore  the 
dielectric  profile  of  a  target  under  test  [8]  if  the  dielectric 
parameters  of  the  body  are  just  slightly  different  from  those  of 
embedding  material.  If  the  difference  in  the  electrical 
parameters  is  relatively  large,  direct  inversion  becomes 
analytically  difficult  and  a  number  of  iterative  techniques  have 
been  developed.  Most  of  them  require  numerical  solution  of 
the  direct  and  inverse  problem  and  are  time  consuming  [20]. 
This,  in  turn,  has  stimulated  development  of  multiresolution 
numerical  techniques  for  direct  problems  [21].  Another 
characteristic  point  of  classical  microwave  reconstruction 
techniques  is  the  assumption  that  all  the  measurements  are 
performed  in  the  far  field.  However,  a  large  class  of  the 
problems  (such  as  land  mine  detection)  requires  near-range 
data  acquisition.  A  few  publications  can  be  found  in  the  open 
literature  [22].  Due  to  redundancy  of  inverse  problems  for 
detection  purposes,  simplified  statistical  models  have  also  been 
developed  [23]. 


Figure  9.  Narrowband  ISAR  image  of  a  plate  (FDTD  data) 
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1.  SUMMARY 

Previous  work  by  the  first  two  authors  has  shown  that  it¬ 
erative  procedures  for  solving  nonlinear  inverse-scattering 
problems  that  are  based  on  a  local  linearization  can  be  used 
to  reconstruct  objects  of  large  contrast  with  a  given  resolu¬ 
tion,  provided  that  multiple-frequency  information  is  avail¬ 
able.  This  approach  has  become  feasible  since  combining 
the  CGFFT  method  with  a  special  extrapolation  procedure 
leads  to  an  extremely  fast  solution  of  the  corresponding 
direct-scattering  problem  for  varying  angle  of  incidence 
and  contrast.  Presently,  we  are  trying  to  overcome  two 
of  the  limitations  of  this  approach.  First,  we  replace  the 
“update  step”  by  a  line  search  in  a  nonlinear  optimization 
procedure.  Second,  we  use  the  scattering  response  of  the 
estimated  object  in  a  homogeneous  environment  to  recon¬ 
struct  objects  in  more  realistic  measurement  set  ups  like  a 
water-filled  metal  cylinder. 

2.  FORMULATION  OF  THE  PROBLEM 

We  consider  an  inhomogeneous,  lossless  dielectric  cylin¬ 
der  in  an  observation  domain  Vq,  embedded  in  vac¬ 
uum  and  excited  by  a  time-harmonic  electric  line  source 
on  a  circular  contour  dVo  around  the  cylinder  (Fig.  1). 
“Complete”  scattering  data  are  available,  i.e.  the  scattered 
electric  field  is  known  on  dVo  for  a  line  source  anywhere 
on  that  contour.  The  question  is  to  what  extent  we  can  re¬ 
construct  the  relative  permittivity  from  this  field 

information.  A  Fourier  transformation  with  respect  to  the 
angular  coordinate  shows  that;  (i)  any  incident  field  in  Vq 
can  be  generated  by  a  surface  current  on  dT>o  \  (ii)  any  scat¬ 
tered  field  in  Vq  is  completely  determined  by  its  value  on 
dVo-  This  implies  that  excitation  or  detection  in  Vq  does 
not  give  new  information,  which  explains  the  designation 
“complete”. 

Mathematically,  the  field  caused  by  each  line  source  can 
be  identified  as  a  Green’s  function,  i.e.,  the  solution  of  the 
second-order  differential  equation 

[Vt- ^'^^r{p)]G{p,Ps)  =  -5{p- Ps)  (1) 

that  satisfies  the  radiation  condition  as  p  =  \p\  ->  oo.  In 
(1),  5  is  a  complex  frequency  with  Re(s)  >  (). 

For  the  function  G{p,Ps)  the  following  contrast-source 


FIGURE  I :  typical  inverse-scattering  geometry. 
integral  relation  can  be  derived: 

G{p,ps)  =  G’-^^{p,Ps)  -  s'  //  G'"/(p,p')  X 

J  JVo 

[er{p')-e:^f{p')\G(p',Ps)dA',  (2) 

which  holds  for  any  {p,  pg}  6  T>oo-  fr  (2),  the  superscript 
pertains  to  a  known  reference  configuration. 

2.  FORWARD  PROBLEM 

Before  we  address  the  inverse  problem,  we  must  be  able 
to  determine  the  Green’s  function  G(p,  P5)  for  a  known 
permittivity  er{p)-  This  field  is  obtained  from  the  so-called 
field  equation,  which  is  obtained  from  (2)  by  taking  p  E 
T>o  and  =  1.  We  obtain  the  contrast-source  integral 
equation: 

G(p,  Ps)  =  Go  (p,  Ps)  -  s'  f  f  Go  (p,  p')  X 

J  JVo 

[er{p')-\]G{p\ps)dA'.  (3) 
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In  (3),  Go(p,  p')  is  the  two-dimensional  Green’s  function 
of  free  space 

Go(p,p')  =  ^^oNp-p't),  (4) 

where  Kq  is  the  modified  Bessel  function  of  the  second 
kind  of  order  0. 

Equation  (3)  is  discretized  as  described  in  [2].  First,  we 
decompose  Go(p,  p')  into  a  stationary  part,  which  is  loga¬ 
rithmically  singular  as  i?  =  |P“P1  — ^  0,  and  a  part  which, 
apart  from  an  additive  constant,  behaves  as  B?  \n{R).  This 
results  in  two  integrals,  which  can  both  be  approximated 
with  the  aid  of  piecewise-bilinear  interpolation  with  mesh 
size  h.  This  results  in  a  convolution-type  form  that  is  accu¬ 
rate  up  to  ln(/i)]  as  /i  — )•  0.  We  choose  h  so  small  that 
the  error  in  determining  G{p,  p^)  does  not  affect  the  inver¬ 
sion  procedure.  The  resulting  discretized  system  is  solved 
by  CGFFT,  using  the  extrapolation  procedure  outlined  in 
[1]  to  generate  the  successive  initial  estimates.  This  proce¬ 
dure  is  known  as  “marching  on  in  anything”;  in  the  present 
application,  we  march  either  in  angle  of  incidence  or  in 
contrast.  Typically,  the  full  scattering  matrix  is  available 
at  the  cost  of  at  most  a  few  field  computations  with  the 
original  CGFFT  procedure. 

3.  COST  FUNCTION  AND  LINEARIZATION 
In  numerical  schemes  for  solving  the  inverse-scattering 
problem,  it  is  convenient  to  introduce  a  parameterization 
of  the  unknown  permittivity: 

^{P)  ='^^0,(l>Cc{p)  (5) 

a 

where  (f>a{p)  is  a  known  expansion  function.  Within  the 
space  spanned  by  these  expansion  functions,  we  define  the 
“reconstruction”  as  the  linear  combination  for  which  the 
squared  error 

n2ir  /.27J-  ^  2 

ERR  =  l  dipR  /  dips\G{pR,Ps) -G(pr,Ps)\^  (6) 

is  minimized.  In  (6),  G{pfi^pg)  is  the  known  field  at 
receiver  position  p  =  pj^  for  a  source  at  p  =  pg,  and 
G{Pr,Ps)  is  the  field  in  the  configuration  corresponding 
to  the  set  of  parameters  {^a}.  With  (5)  and  (6),  the  inverse¬ 
scattering  problem  has  been  reduced  to  an  optimization 
problem  for  a  fixed  number  of  parameters. 

Most  inversion  schemes  are  based  on  a  linearization  of 
the  cost  function  (6)  around  some  “best”  estimate  e{p)  = 
0Q  (p)-  Taking  =  e  results  in  the  integral  rela¬ 
tion: 

G{P,  P')  -  G{p,  p')  -s^  [[  [Srip")  -  irip")]  X 

J  JVo 

G{p,p")G{p",p')dA,  (7) 


In  this  equation,  we  replace  e{p)  by  the  parameterized  ver¬ 
sion  (5),  and  we  minimize  the  cost  function  (6),  augmented 
by  a  regularization  term  that  favours  a  smooth  solution. 
This  should  result  in  a  “better”  estimate  e{p). 

Linearized  inversion  schemes  like  distorted-wave  Born  and 
Newton-Kantorovitch  essentially  amount  to  alternately  de¬ 
termining  G(p,  ps)  by  the  procedure  outlined  in  Section  2, 
and  e{p)  as  explained  above.  This  procedure  gives  rise  to 
two  fundamental  questions. 

•  First,  how  much  information  can  be  retrieved?  To 
answer  this  question,  we  consider  a  small  perturbation 
(a  “pixel”)  in  a  known  configuration,  and  we  use  the 
distorted-wave  Born  approximation  given  in  (8)  to  recon¬ 
struct  it  in  a  single  step.  This  step  is  representative  of  the 
final  stage  of  the  iterative  procedure.  We  formally  arrive  at 
a  SAR-like  processing  scheme  which  shows  that  the  reso¬ 
lution  improves  with  increasing  uyjsrip).  This  has  been 
confirmed  by  implementing  the  reconstruction  scheme  for 
the  special  case  where  the  estimated  configuration  is  a  ho¬ 
mogeneous  circular  cylinder. 

•  Second,  will  the  scheme  converge?  The  dynamic  range 
of  the  procedure  depends  on  the  error  in  the  linearization, 
i.e.,  on  the  magnitude  of  the  term  of  0{[er  —  ^r]^)  ia  (8). 
For  a  general  configuration,  it  is  hard  to  give  a  quantitative 
estimate  of  this  “Born  error”.  Therefore,  we  have  inves¬ 
tigated  its  behavior  from  closed-form  expressions  for  ho¬ 
mogeneous  circular  cylinders.  It  turns  out  that  the  “Born 
error”  increases  with  increasing  contrast  — e|,  difference 
in  object  dimension,  and  frequency  uj. 

To  reconcile  these  observations,  we  start  with  a  small  u 
to  obtain  the  “trend”.  In  a  few  steps,  we  then  increase  the 
frequency  to  obtain  more  and  more  “detail”,  using  previous 
results  as  initial  estimates  for  Srip)^ 

3.  EXTENSIONS 

With  the  fast  forward  solution  of  Section  2  and  the  theo¬ 
retical  analysis  of  Section  3,  linearized  inversion  schemes 
have  again  become  attractive.  However,  the  availability 
of  the  full  scattering  matrix  also  offers  the  possibility  to 
overcome  some  of  the  limitations  of  the  scheme  outlined 
in  Section  3. 

•  A  first  idea  is  to  replace  the  determination  of  i{p)  by 
a  line  search  in  a  nonlinear  optimization  scheme.  Substi¬ 
tuting  the  linearized  equation  in  the  cost  function  gives  the 
profile  gradient  in  closed  form. 

p2n  p27r 

I  I  l^ipRy  Ps)  ~ 

Jo  Jo 

p2n  p2t7 

^  I  /  \G{pr,Ps)  -G[pR,ps)\^dipRdAps  (9) 

Jo  Jo 

{p  p  p2tt  p2t: 

s  /  /  [£(p)  -  e(p)]  /  /  <5(p,  Pr)  X 

J  JVo  VO  ^0 

Gip,  Ps)  [G{Pr,Ps)  -  G{pr,  Ps)]*  d‘Ps  dA\ 


for  p,  p'  G  2^00*  We  use  the  same  representation  for 
G(p",p')  in  the  integrand  on  the  right-hand  side  and  use 
the  reciprocity  relation  G{p,p')  =  G(p',p)  to  obtain  the 
approximate  equation: 


-  (p)]  G{p,  Pr)  G{p,  Ps)  dA 


—  ^ipRiPs)  ~  ^{Pr^Ps)  +  G{[Er  —  erY).  (8) 


By  “marching  on  in  search  parameter”  we  obtain  a  similar 
efficiency  of  the  successive  field  computations  as  in  the  lin¬ 
earized  schemes.  This  idea  has  recently  been  implemented 
in  combination  with  the  BFGS  method  [3]. 

•  A  second  idea  is  to  consider  a  more  general  environ¬ 
ment.  The  CGFFT  method  can  only  be  applied  to  objects 
in  a  homogeneous  environment.  Practical  measurements, 
e.g,  for  biological  tissue,  must  be  carried  out  in  a  more 
complicated  set-up  like  a  water-filled  metal  cylinder.  Such 


a  set-up  is  presently  being  developed  by  the  last  two  au¬ 
thors  (Fig.  2).  In  that  case,  a  direct  computation  of  the 
field  is  relatively  complicated  from  a  computational  point 
of  view.  Hence,  the  idea  is  to  use  the  procedure  outlined 
above  to  determine  the  complete  scattering  response  of  the 
estimated  object  in  a  homogeneous  environment,  and  sub¬ 
sequently  use  a  spectral  formulation  to  “embed”  this  object 
in  the  actual  environment  [4].  That  environment  is  then 
characterized  once  by  experimental  and/or  computational 
techniques. 


FIGURE  2:  set-up  for  biological  tissue. 

At  the  conference,  both  ideas  will  be  discussed,  and  the 
first  numerical  results  will  be  presented.  To  this  end,  the 
modified  algorithm  will  be  applied  to  theoretical  as  well  as 
experimental  results. 
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1.  ABSTRACT 

This  paper  describes  a  simple  algorithm  for  reconstructing 
the  complex  index  of  refraction  of  a  bounded  object  im¬ 
mersed  in  a  known  background  from  a  knowledge  of  how 
the  object  scatters  known  incident  radiation.  The  method 
described  here  is  versatile  accommodating  both  spatially 
and  frequency  varying  incident  fields  and  allowing  a  pri¬ 
ori  information  about  the  scatterer  to  be  introduced  in  a 
simple  fashion.  Numerical  results  show  that  this  new  algo¬ 
rithm  outperforms  the  modified  gradient  approach  which 
until  now  has  been  one  of  the  most  effective  reconstruction 
algorithms  available. 


ject  (or  objects)  whose  location  and  index  of  refraction 
or  contrast  is  unknown  but  which  is  known  to  lie  within 
another,  larger,  bounded  simply  connected  domain  D,  If 
uj{p)  =  denotes  an  incident  wave  with 

wavenumber  kj  (assumed  to  be  real)  and  source  point  Qj 
{qj  is  replaced  by  the  unit  vector  Qj  for  plane  waves)  then 
for  a  large  class  of  scattering  problems  the  total  field  in  D 
is  known  to  satisfy  the  integral  equation 

Ujip)  =  -uTip) 

+  k'^  f  Gj{p,q)xj{g)uj{q)dv{q)  ,  0) 


2.  INTRODUCTION 

The  problem  of  reconstructing  the  complex  index  of  re¬ 
fraction  of  a  bounded  object  immersed  in  a  known  back¬ 
ground  medium,  from  a  knowledge  of  how  the  object  scat¬ 
ters  known  incident  acoustic  or  electromagnetic  radiation, 
has  received  a  tremendous  amount  of  attention  in  the  past 
decade.  Almost  all  reconstruction  algorithms  rely  in  some 
way  upon  the  Lippmann-Schwinger  equation  or  domain  in¬ 
tegral  equation  for  the  field  inside  the  scattering  object  as 
well  as  the  related  integral  representation  for  the  field  out¬ 
side  the  object. 

The  present  paper  describes  a  simple  algorithm  for  recon¬ 
structing  unknown  contrasts  which  is  extremely  versatile, 
accommodating  both  spatially  and  frequency  varying  inci¬ 
dent  fields  and  allowing  for  the  introduction  of  a  priori  in¬ 
formation,  such  as  positivity  constraints,  in  a  simple  fash¬ 
ion.  The  algorithm  is  a  variant  of  the  source  type  integral 
equation  (STIE)  method  introduced  by  Habashy  etal.  [11] 
on  one  hand  and  the  modified  gradient  approach  used  by 
the  authors  in  [12],  [13],  [14]  on  the  other.  Numerical 
results  show  that,  despite  the  simplicity  of  the  algorithm, 
it  outperforms  the  modified  gradient  approach  which  has 
been  one  of  the  most  effective  reconstruction  algorithms 
available  until  now  [25].  We  present  here  the  simplest  ver¬ 
sion  of  the  algorithm  wherein  we  treat  scalar  waves  in 
for  bodies  immersed  in  a  homogeneous  background. 

3.  NOTATION  AND  PROBLEM  STATEMENT 

Denote  by  p  and  q  position  vectors  in  and  let  B  de¬ 
note  a  bounded,  not  necessarily  connected,  scattering  ob- 


where  Gj  (p,  q)  denotes  the  Green  function  of  the  back¬ 
ground  medium, 


Gj{p,q)  ■' 


-Hi  \kj\p-q\) 


(2) 


and 


Xj 


k'^{<l,kj) 


k] 


-1. 


(3) 


We  assume  that,  while  the  spatial  dependence  of  Xj  is  un¬ 
known,  the  frequency  dependence  is  known,  so  that,  for 
example,  in  electromagnetics  for  a  Maxwell  medium. 


kj  =  ujj{sopop 


(4) 


and 


Xi(9)  = 


£(<?)  -  gp 
^0 


+  i 


(5) 


where  So  and  po  are  the  permittivity  and  the  permeability 
of  the  (lossless)  background,  while  s{q)  and  a{q)  are  the 
permittivity  and  conductivity  of  the  scatterer  which  is  as¬ 
sumed  to  be  nonmagnetic.  Equation  (5)  may  also  be  writ¬ 
ten  as 


XjiQ)  =  X’’(g)  + 


UJQ  ko 


(6) 


where  ko  is  the  wavenumber  for  j  =  0.  This  equation 
simplifies  to 

XjiQ)  ~  x(g)  =  x’'{q)  +  ix‘(g) .  G) 
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Lisbon,  Portugal,  21-22  September  1998,  and  published  in  RTO  EN-2. 
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if  all  the  measurements  are  made  at  the  same  frequency, 
say  at  kj  =  ko . 

Observe  that  if  p  is  not  in  B  then  Xj  vanishes,  but  if  the  lo¬ 
cation  of  B  is  unknown  then  it  is  not  known  a  priori  where 
Xj  vanishes.  However  with  the  assumption  that  B  C  D  it 
is  known  that  Xj  vanishes  for  p  outside  D.  In  fact  denoting 
by  5  a  domain  (or  curve,  or  a  discrete  collection  of  points) 
outside  of  D  where  the  scattered  field  is  measured  to  be 
fj{p),  (1)  becomes 

fjip)  =  /  GjiPi  9)  X}{q)  uM)  dv(q') ,  (8) 

Jd 

pe5, 

if  there  is  no  noise  or  error  in  the  measurements.  But  error 
free  data  are  extremely  unlikely  and  we  do  not  assume  that 
(8)  holds  exactly.  Rewriting  (1)  and  (8)  in  symbolic  form 
we  have  the  object  or  state  equations 

Uj  =  uf  +  Gf  XjUj ,  pe  D  ,  (9) 

and  the  data  equations 

fj  ”  Xj'^j  ^  P  ^  ^  1  (10) 

the  superscripts  D  and  5  on  the  operators  defined  implic¬ 
itly  in  (1)  and  (8)  are  added  to  accentuate  the  location  of 
the  point  p,  since  the  operators  are  identical  in  all  other 
respects. 

4.  THEORETICAL  BACKGROUND 

In  the  absence  of  other  a  priori  information,  (9)  and  (10) 
are  the  only  equations  we  have  relating  the  unknown  con¬ 
trast  Xj  (which,  recall,  consists  of  at  most  two  unknown 
real  valued  functions)  and  the  unknown  fields  Uj  in  D. 
The  known  data  consist  of  the  incident  fields,  uJ,  the 
wavenumbers,  kj,  the  measured  data,  fj,  and  the  test  do¬ 
main  D.  Equations  (9)  and  (10)  are  linear  in  each  of  the 
unknowns  Xj  and  Uj,  but  since  both  are  unknown  the  prob¬ 
lem  is  in  fact  mildly  nonlinear.  Of  course  the  dependence 
of  the  fields  Uj  on  the  contrast  Xj  is  highly  non-linear.  This 
may  be  seen  by  writing  the  formal  inverse  of  (10)  as 

Uj  ^  {I  -  Gf  ■  (11) 

This  form  has  been  utilized  in  a  number  of  inversion  meth¬ 
ods.  Introducing  it  into  the  data  equations  we  obtain 

fi  =  Gl[xj{I-Gfxi)-^uJ],  (12) 

wherein  the  non-linearity  of  the  inverse  problem  is  clearly 
exposed.  Approximating  the  inverse  operator  by 

(13) 

leads  to  the  Bom  approximation,  while  in  iterative  meth¬ 
ods,  where  a  sequence  {Xi,n}  is  constructed,  the  approxi¬ 
mation 

{I  -  Gfxj,n)-^  «  (/  -  Gf  (14) 

gives  rise  to  the  iterative  Bora  method  [20],  while  the  lin¬ 
earization  of 

(/-Gfxi.n)-*  « 

~Gj  Xj,n-1  ~Gj  (Xj,n  — Xi,n-l)]  ^  (15) 


in  terms  of  Axj.n  =  Xj,n-Xj,n-u  namely, 

{I-Gfxj,n)-^  « 

[/  +  (/-  Gf  xy,n-i)-'Gf  Ax;,„](/  -  Gf  xy.n-i)-' 

(16) 

leads  to  the  Newton-Kantorovich  method  [17],  [19]  which 
has  been  shown  to  be  equivalent  to  the  Distorted  Bom 
approach  [6].  Observe  that  at  each  step  in  the  itera¬ 
tion  it  is  necessary  to  compute  the  action  of  the  opera¬ 
tor  [/  -  Gfxj,n-i\~^  for  known  Xj,n-i  This  means  that 
forward  problems,  or  direct  scattering  problems  must  be 
solved  at  each  iterative  step. 

A  method  which  avoids  the  necessity  of  solving  forward 
problems  completely  was  proposed  by  the  authors  [12]  and 
was  refined  [13],  [14],  [25]  and  extended  [24].  This  mod¬ 
ified  gradient  method  involves  the  simultaneous  construc¬ 
tion  of  sequences  {wj,n}  and  {Xj,n}  to  minimize  the  error 
in  a  cost  functional  consisting  of  the  normalized  errors  in 
both  state  equations  (9)  and  data  equations  (10).  It  has 
proven  to  be  very  effective  in  a  large  number  of  numerical 
tests  using  both  synthetic  and  experimental  data. 

Because  the  contrast  and  fields  occur  as  a  product,  many 
workers  have  introduced  the  quantity 

Wj  =  XjUj  ,  (17) 

which  is  called  a  contrast  source  since  Uj  satisfies  the  equa¬ 
tion 

(V^  -h  kj)uj  =  -kjwj  in  B  ,  (18) 

Then  the  data  equations  become 

fj  =  G^Wj  ,  (19) 

while  the  state  equations  become 

Uj  =  u'j''  -h  Gfwj  ,  (20) 

or,  with  (17), 

Xuf  ^  Wj  -  XjGfwj  .  (21) 

Equation  (19)  is  called  by  some  a  source  type  integral 
equation  and  it  has  a  long  history.  It  is  a  classic  ill-posed 
equation  and  for  a  time  there  was  considerable  attention 
paid  to  the  question  of  uniqueness,  e.g.,  [1],  [8],  [2].  It 
was  shown  that  there  exist  non-trivial  solutions  of  the  ho¬ 
mogeneous  form  of  (19),  although  it  was  argued  by  some 
that  uniqueness  could  be  restored  from  physical  consider¬ 
ations.  A  good  summary  of  the  debate  is  given  by  De- 
vaney  and  Sherman  [9]  and  the  responses  by  Bojarski  [3] 
and  Stone  [18].  It  is  not  our  intent  to  renew  this  contro¬ 
versy  since  it  is  now  well  accepted  that  non-trivial  solu¬ 
tions  of  (19)  exist.  Moreover  it  has  also  been  shown  that 
the  minimum  norm  solution  of  (19),  the  solution  produced, 
for  example,  by  the  conjugate  gradient  method,  is  not  the 
appropriate  physical  solution.  Nonetheless  this  source  type 
equation  has  served  as  an  essential  ingredient  in  many  in¬ 
version  procedures,  e.g.,  [10],  [4],  [5],  [7],  [11].  Habashy 
et  ai  [11]  present  an  inversion  method  wherein  the  min¬ 
imum  norm  solution  of  (19)  is  found  first  and  then  a  ba¬ 
sis  for  the  orthogonal  complement  of  this  solution  is  con¬ 
structed  in  terms  of  which  the  physical  solution  is  sought 
to  satisfy  (21).  Van  den  Berg  and  Haak  [22]  proposed  a 
variant  of  this  technique  wherein  the  full  minimum  norm 
solution  is  not  found  but  rather  it  is  sought  iteratively,  us¬ 
ing  conjugate  gradient  steps,  with  the  contrast  updated  at 
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each  step  to  satisfy  (21)  and  a  new  source  defined  through 
(17).  This  approach  yielded  promising  numerical  results 
however  the  error  did  not  decrease  monotonically. 

In  the  present  paper  we  propose  a  method  which  combines 
spirit  of  the  approach  of  Van  den  Berg  and  Haak  [22]  using 
the  source  type  integral  equation  with  that  of  the  modified 
gradient  approach  by  seeking  linear  updates  in  the  source 
in  an  error  reducing  method  which  does  not  require  the 
solution  of  any  forward  problem. 


5.  CONTRAST  SOURCE  INVERSION  METHOD 

As  in  the  modified  gradient  as  well  as  the  Van  den  Berg- 
Haak  approach  we  simultaneously  construct  sequences  of 
sources  wj^n^  fields  uj^n  and  contrasts  Xj,n  to  minimize 
a  cost  functional.  Rather  than  choosing  a  cost  functional 
consisting  only  of  errors  in  the  data  equation,  as  Van  den 
Berg  and  Haak  did,  we  define  the  cost  functional 


Ej  ll/.lll 

Ej  WxjuT  ~  +  XjGf  wjW], 

Eillx,«rilD 


(22) 


where  ||  •  |1|  and  ||  •  |||>  denote  the  norms  on  L2(5)  and 
L2(jD),  respectively.  The  normalization  is  chosen  so  that 
both  terms  are  equal  to  one  if  Wj  =0.  The  first  term  mea¬ 
sures  the  error  in  the  data  equations  and  the  second  term 
measures  the  error  in  the  form  of  the  state  equations  given 
in  (21).  This  is  a  quadratic  functional  in  wj,  but  highly 
nonlinear  in  Xj-  We  propose  an  iterative  minimization  of 
this  cost  functional  using  an  alternating  method  which  first 
updates  wj  and  then  updates  Xj-  Thus  we  construct  se¬ 
quences  {wj^n}  and  {Xi,n}»  for  n  =  0,  1,  2,  •  •  •,  in  the 
following  manner. 

Define  the  data  error  to  be 

Pj,n  ~  fj,n  ^j,n  ?  (23) 

and  the  state  error  to  be 

'^j,n  ~  Xj,n'^j,n  ‘^j,n  j  (24) 

where 

Uj,n  =  '^j,n  ■  (25) 

Now  suppose  Wj^n-I  and  Xj,n-i  are  known.  We  update 
Wj  by 

Wj^fi  ~  Oij,n'^j,n  •)  (26) 

where  aj^n  is  constant  and  the  update  directions  vj^n  are 
functions  of  position. 

The  update  directions  are  chosen  to  be  the  Polak-Ribiere 
conjugate  gradient  directions 

'^j,n 


=  0, 

-  9j,n  +  ,  (27) 

{9j,n—l  5  9j,n—l)D 

n  >  1 , 


where  gj^n  is  the  gradient  (Frechet  derivative)  of  the  cost 
functional  with  respect  to  wj  evaluated  at  wj^n-ii 
while  ')d  denotes  the  inner  product  on  L2(D).  Explic¬ 
itly  this  found  to  be 


9j,n  ~ 


Pj,n-l 

EJIMII 


Ek  lIXfc.n-lwirllD 


where  Gf'  and  are  the  adjoints  of  Gj  and  G^  map¬ 

ping  1-2(5)  into  L2{D)  and  L2{D)  into  L2{D),  respec- 
tively.  Further  the  overbar  denotes  complex  conjugate. 
With  the  update  directions  completely  specified  the  con¬ 
stant  aj^n  is  determined  to  minimize  the  cost  functional 

„  Ei\\fj-Gfwj,„\\l 

Ej\\X},n-lUj,n  -  Wj^nWl, 

^  Ej\\Xj,n-lUf\\l 

"  eTm 

Ej\\Xj,n-lUj\\l 

and  is  found  explicitly  to  be 


^j,n 


a 

b 


c 

d 


Q,  b 

ETd’ 

{p 3,71-1'!  G j  '^3,n)s 

EJIMII  ’ 

1  ? ‘^j,n  “  Xi,n— '^j,7i)D 

Efc  \\Xk,n-l'^k  Wd 

\\Gj'^j,n\\s 

EJIMII  ’ 

\\Vj,n  ~~  Xj,n-lGj 

Efc  \\Xk,n-lUk\\D 


(30) 


where  (•,  ')s  denotes  the  inner  product  on  1/2(5). 

Once  Wj^n  is  determined,  Uj,n  is  obtained  via  (25)  and  (26) 
as 

=  Uj ^fi—l  “b  ^3,nGj  '^3,711  (31) 


and  we  then  seek  Xj  to  minimize  the  cost  functional 


Fd^ 


\\X3'^3.n-'^3.n\?D 

E,llxi^,>lll, 


(32) 


Since  this  minimization  is  not  so  easy  especially  in  the  case 
of  a  priori  information,  we  use  a  minimization  in  two  steps. 
First  we  minimize  the  much  simpler  cost  functional 

F'd  -  EjllXjMj.n  -  «/j,n|lD  ,  (33) 

and  then  we  use  the  found  contrast  as  an  optimization  di¬ 
rection  in  a  line  minimization  to  minimize  (32).  This  two- 
step  minimization  technique  guarantees  that  the  process  is 
always  error  reducing  and  allows  for  easy  implementation 
of  a  priori  information  or  constraints  on  Xj-  Since  we  fi¬ 
nally  need  to  minimize  the  cost  functional  of  (32),  we  de¬ 
fine  the  contrast  function  that  minimizes  (33),  Xj,  through 


FoiXj)  =  inin(F;))  (34) 

Restricting  attention  to  inhomogeneities  complying  with 
the  Maxwell  model,  (6),  and  in  the  absence  of  any  a  pri¬ 
ori  information  on  x»  we  find  that,  [23],  is  minimized 
by  choosing 

"  E,>f.nP 
EjVA^i.‘Wj,nUj,n) 

■  E,/?|l%'.nP 


(28) 


(35) 
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However  if  a  priori  information  is  available  then  it  is  rel¬ 
atively  simple  to  incorporate  it  in  choosing  Xn-  If  either 
or  is  known,  then  we  merely  use  this  known  value  in 
place  of  either  the  first  equality  or  second  equality  of  (35). 
Thus,  for  example,  if  x^  =  0,  we  limit  our  reconstruction 
procedure  to  Xn  from  the  outset.  If  we  have  a  priori  infor¬ 
mation  that  x^  and  x^  are  positive,  we  use,  [23] 


(37) 


These  choices  of  Xn  Xh  coincide  with  those  obtained 
by  Kohn  and  McKenney  [15]  for  an  optimization  problem 
with  a  positivity  constraint  and  employed  in  the  modified 
gradient  algorithm  [14]  with  good  results. 

Next  a  line  minimization  is  used  to  make  the  cost  func¬ 
tional  of  equation  (32)  error  reducing.  We  introduce  a  con¬ 
trast  update  direction  as 

—  Xn  ~  Xn  — 1  (38) 

and  we  write  Xn  as 

Xn  —  Xn-l  +  Odn  (39) 

Then  0  is  chosen  to  minimize  the  cost  functional  of  equa¬ 
tion  (32) 

E,-  WXnUfWl 

_  ItXn  — ^j,n  4"  ^iXn  ~  Xn—l)'^j,n\\^ 

Ylj  WXn-l'^'j''  +  0{Xn  ~  Xn-l)'^'j^\\‘o 
ciO^  2b0  c 

~  TW+WFVc' 

where 

«  =  X^II(Xn-Xn-l)Wi,n|lD, 

3 

^  ^  ^^{Xn—l'^j,n~y3j^ji,  (Xn  Xn—l)'^j,n)D^ 

3 

C  ^  ^  \\Xn  —  l'^i,n~3i3j^n\\D'> 

3 

^  ~  ^^\\iXn~Xn-l)uY\\i>'> 

3 

B  =  ReY^{Xn-lUj,{Xn~Xn-l)uJ)D^ 

3 

c  =  (41) 

3 

This  is  the  quotient  of  two  quadratics  which,  using  elemen¬ 
tary  analysis,  may  be  shown  to  attain  its  minimum  when 


-{aC  -Ac) 
2{aB  -  Ab) 


This  completes  the  description  of  the  algorithm  except  for 
designating  the  starting  values  wj^q.  Observe  that  we  can¬ 
not  start  with  Wj^o  =  0  since  then  Xo  =  Xo  ”  (^ 
cost  functional  (29)  is  undefined  for  n  =  1.  Therefore 
we  choose  as  starting  values  either  the  constant  values  that 
minimize  the  data  error, 


IIGfllll 


(43) 


or  the  values  obtained  by  backpropagation, 


bp_  wGfmi 

||GfGf/,||| 


Gf  fi . 


This  completes  the  description  of  the  algorithm. 


6.  COMPUTATIONAL  TESTS 

A  number  of  tests  have  been  done  with  the  algorithm  in¬ 
cluding  stability  tests,  resolution  tests  and  test  for  various 
kinds  of  contrast  profiles  [23].  Here  some  tests  are  pre¬ 
sented  for  using  the  method  for  reconstructing  AP-mines. 
We  used  one  frequency  of  500  MHz,  a  relative  background 
permittivity  of  =  5  and  a  background  conductivity  of 
(j  =  0  S/m.  Further  we  used  a  computational  domain  of 
29  X  29  cm.  The  AP-mines  were  given  a  relative  permit¬ 
tivity  of  Sr  =  7  and  a  conductivity  of  tj  =  0  S/m.  Their  di¬ 
ameter  varied  between  5  cm  and  10  cm.  The  measurement 
curve,  5,  is  a  circle  of  radius  1  m  and  center  at  the  center 
of  D.  The  discrete  form  of  the  algorithm  is  obtained  by 
dividing  D  into  29  x  29  subsquares,  assuming  the  contrast, 
sources  and  fields  are  piecewise  constant  and  the  integrals 
over  subsquares  were  approximated  by  integrals  over  cir¬ 
cles  of  equal  area  which  were  calculated  analytically  [16]. 
The  discrete  spatial  convolution  of  the  operators  and 
were  computed  using  FFT  routines  [21].  The  incident 
fields  were  chosen  to  be  excited  by  line  sources  parallel  to 
the  axis  of  the  scatterer.  These  sources  were  taken  to  be 
equally  spaced  on  the  measurement  circle,  and  the  source 
locations  were  also  chosen  as  discretization  points  on  the 
circle.  All  integrals  on  S  were  approximated  by  point  col¬ 
location  at  the  discretization  points,  that  is,  the  rectangular 
rule  with  the  integrand  evaluated  at  the  mid-points.  The 
measured  data  were  simulated  by  solving  the  direct  scat¬ 
tering  problem  with  a  conjugate  gradient  method  [21].  The 
circle  S  was  subdivided  into  30  equally  spaced  arcs.  Each 
mid-point  served  as  the  location  of  a  line  source  and  all  the 
mid-points  served  as  receiver.  In  all  test  backpropagation 
has  been  used  for  the  initial  guess. 

In  figure  1,  we  show  the  original  contrast  pro¬ 
file  of  a  circular  mine  with  a  7  cm  diameter. 


Fig  1:  The  original  profile 
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In  figure  2,  we  show  the  reconstruction  of  the  contrast  pro¬ 
file  of  the  mine  after  4  and  after  64  iterations.  Continued 
iteration  provided  no  noticeable  improvement. 

The  conductivity  of  the  mines  is  zero.  We  can  therefore 
set  the  imaginary  part  of  the  contrast  equal  to  zero.  Since 
tests  indicated  that  this  restriction  does  not  improve  the  al¬ 
gorithm,  this  restriction  has  been  left  out.  Furthermore, 
since  we  used  a  permittivity  of  the  mines  which  is  higher 
than  that  of  the  background,  we  could  use  positivity  for  the 
mine  permittivity.  In  figure  3,  we  show  the  reconstructed 
contrast  profile  after  4  iterations  using  CSI  with  a  priori 
information,  (a),  and  after  64  iterations  with  a  priori  infor¬ 
mation,  (b).  Each  iteration  took  approximately  5  seconds 
on  a  Pentium  PC  computer. 

Next  we  have  put  two  mines  in  the  configuration,  one  cir¬ 
cular  mine  having  a  diameter  of  5  cm  and  one  having  a 


Fig  2:  The  reconstructions  after  4  iterations  (a),  and  after 
64  iterations  (b),  using  CSI  without  a  priori  information. 


Fig  3:  The  reconstructions  after  4  iterations  (a),  and  after  Fig  5:  The  reconstructions  after  4  iterations  (a),  and  after 

64  iterations  (b),  using  CSI  with  a  priori  information.  64  iterations  (b),  using  CSI  without  a  priori  information. 
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Fig  6:  The  reconstructions  after  4  iterations  (a),  and  after 
64  iterations  (b),  using  CSI  with  a  priori  information, 

diameter  of  10  cm.  We  have  done  the  same  tests.  The  re¬ 
sults  are  in  figures  4,  5  and  6.  It  is  observed  that  the  resolu¬ 
tion  in  these  figures  is  not  as  good  as  the  resolution  in  the 
figures  of  the  original  paper.  This  is  due  to  the  size  of  the 
domain  which  is  about  |A  x  |A.  Higher  frequencies,  thus 
shorter  wavelengths,  will  lead  to  an  improved  resolution 
but  these  waves  have  a  lower  penetration  depth. 

7.  CONCLUSION 

In  this  paper  we  have  presented  a  new  inversion  algorithm 
(CSI)  for  profile  reconstruction  in  acoustics  and  electro¬ 
magnetics.  The  algorithm  is  based  on  the  source  type  inte¬ 
gral  equation  which  relates  measured  data  to  a  source  dis¬ 
tribution  in  the  scattering  object.  The  algorithm  is  akin  to 
what  Kohn  and  McKenney  [15]  call  an  alternating  direc¬ 
tion  implicit  (ADI)  method  wherein  two  sequences,  one 
of  sources  and  one  of  contrasts,  are  reconstructed  itera¬ 
tively  by  alternately  updating  the  sources  and  the  contrasts. 
Unlike  the  Kohn-McKenney  method  and  other  approaches 
based  on  the  source  type  integral  equation,  e.g.,  Habashy 
et  ai  [11],  the  CSI  algorithm  does  not  involve  completely 
solving  the  source  type  equation  for  each  updated  contrast. 
Similar  to  the  modified  gradient  method,  in  each  iteration 
there  is  no  full  inversion  of  the  state  equations  involved. 

A  cost  functional  is  defined  consisting  of  errors  in  the 
source  type  equations  and  the  state  equations  and  the  up¬ 
dates  in  the  sources  are  found  as  a  conjugate  gradient  step 
after  which  the  contrast  is  updated  by  minimizing  the  er¬ 
ror  in  the  state  equations  which  can  be  done  very  simply. 
The  source  updates  are  similar  in  spirit  to  those  used  in  the 
modified  gradient  method  while  the  contrast  updates  are 
found  in  a  simple  fashion  in  which  a  priori  information  is 
easily  included.  A  number  of  numerical  tests  indicate  that 


this  new  algorithm  exhibits  the  best  features  of  the  mod¬ 
ified  gradient  algorithm,  successfully  reconstructing  a  va¬ 
riety  of  contrasts  and  fairly  insensitive  to  noise.  However, 
the  new  algorithm  exhibits  additional  properties  which  sur¬ 
pass  the  modified  gradient  approach.  It  is  faster,  requires 
less  memory  as  well  as  less  data  and  more  easily  accom¬ 
modates  a  priori  information. 

To  give  some  idea  of  the  computational  complexity,  if  J 
denotes  the  number  of  excitations  and  N  denotes  the  num¬ 
ber  of  subdomains  in  the  test  domain,  then  the  time  re¬ 
quired  for  each  iteration  is  roughly  2  J  times  the  time  for 
one  step  in  the  conjugate  gradient  solution  of  the  forward 
problem  for  one  excitation,  with  a  memory  requirement 
of  approximately  bJN  x  16  bytes  (complex  double  pre¬ 
cision).  The  time  required  for  each  iteration  is  roughly  one 
third  that  needed  in  the  modified  gradient  algorithm  with 
no  a  priori  information  on  the  contrast  and  is  an  order  of 
magnitude  faster  if  positivity  is  included  for  both  real  and 
imaginary  parts. 

No  tests  have  yet  been  carried  out  on  the  effect  of  ad¬ 
ditional  regularizers  such  as  total  variation  which  proved 
effective  for  the  modified  gradient  algorithm  [24].  This 
is  one  of  the  subjects  for  future  work.  The  simplicity, 
speed  and  reduced  memory  requirements  offer  hope  that 
this  technique  will  provide  a  feasible  approach  to  three- 
dimensional  inversion  problems. 
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Summary 

Location  and  identification  of  buried  land 
mines  is  a  real  challenge  for  sensor 
technology  and  target  identification 
algorithms.  We  analyse  the  performance  of 
a  bistatic  microwave  imaging  system  with  a 
focused  synthetic  aperture.  If  the  soil  is 
homogeneous  and  dry  with  a  very  smooth 
surface  it  will  be  possible  to  identify  even 
plastic  mines  under  a  10  cm  overburden. 
However,  under  a  rough  surface  or  in  wet 
soil  even  a  relatively  big  metallic  anti-tank 
mine  could  be  missed,  since  the  signal  to 
clutter  ratio  gets  quite  poor  under  these 
circumstances. 

1.  Introduction 

Military  conflicts  left  vast  tracts  of  country 
under  the  suspicion  of  contamination  by 
land  mines.  Efficient,  reliable  mine 
detecting,  identification  and  clearing 
techniques  are  urgently  needed.  Buried  land 
mines  present  a  particular  hard  problem. 
Standard  detection  procedures  for  buried 
land  mines  still  rely  on  metal  detectors  and 
bayonet  probing.  Brute  force  clearing  uses 
deep  plowing,  moulding  and  harrowing. 
Obviously,  these  procedures  can  be  applied 
only  on  limited  areas. 

The  buried  mines  range  from  small, 
inexpensive  anti-personal  plastic  mines  with 
very  little  metal  content  to  bulky  metallic 
anti-tank  mines.  Possible  mine  shapes 
include  disks,  cylinders,  square  boxes  and 
more  fancy  geometries.  The  sizes  of  mines 
start  at  about  5  cm  and  may  go  up  to  40 
cm.  Looking  from  above  ground,  it  is 
virtually  impossible  to  find  a  unique 
signature  that  can  be  used  to  discriminate  a 
buried  mine  from  other  similar  looking 
pieces  of  scrap.  A  low  false  alarm  rate, 
however,  is  required  for  an  efficient 


clearing  of  expanded  lots  suspected  of 
mines. 

The  requirements  on  civilian  mine  clearing 
are  quite  ambitious; 

•  Detection  rate  of  99.9%  or  better, 

•  Operating  at  rough  soil  surface  as  well 
as  for  wet  soil, 

•  Low  false  alarm  rate,  i.e.  clear 
distinction  of  mines  versus  scrap,  rock 
pieces,  soil  surface  structures. 

It  was  suggested  to  exploit  the  explosive 
charge  which  is  contained  in  every  mine  to 
detect  them  under  a  shallow  soil 
overburden.  Almost  all  explosive  charges 
consist  of  strongly  nitrogenious  chemicals. 
Therefore  a  high  local  nitrogen 
concentration  in  the  soil  may  indicate  a 
buried  mine.  The  proposals  for  spotting 
local  nitrogen  concentrations  close  to  the 
soil  surface  require  variing  technical 
expenditure.  Sniffing  by  specially  trained 
dogs  and  y-ray  activation  of  the  natural 
nitrogen  isotop  with  subsequent  detection 
of  the  resulting  instable  isotop  are  two 
examples  that  have  been  suggested  [1]. 
Another  starting  point  for  the  detection  and 
identification  of  buried  mines  hopes  to 
exploit  the  casing  geometry  of  buried  mines 
as  seen  by  microwaves.  An  aspect 
independent  way  of  perceiving  the 
geometry  of  a  target  is  given  by  the 
complex  frequency  poles  of  the  electro¬ 
magnetic  scattering  amplitude  caused  by  a 
buried  target  if  it  gets  illuminated  by  micro- 
waves  [2].  If  one  has  a  catalogue  of  these 
complex  frequency  pole  patterns  for  all 
interesting  buried  targets  one  may  hope  to 
determine  the  target  from  an  analysis  of 
microwave  scattering  data  in  the  relevant 
frequency  band. 

One  diffculty  with  this  method  when 
applied  to  targets  buried  in  the  soil  is  the 
variing  influence  of  the  soil  overburden  on 
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the  received  scattering  signal.  This  is  in 
contrast  to  the  better  known  application  of 
this  identification  technique  to  airborne 
targets.  The  influence  of  the  air  on  the  back 
scattering  data  is  much  smaller  and 
reasonably  well  under  control.  The  effect  of 
soil  on  the  comlex  resonance  frequencies  of 
a  buried  target  varies  with  the  thickness  and 
composition  of  the  soil  .  overburden. 
Furthermore  in  the  S-band,  which  is  most 
relevant  for  mine  identification,  wet  soil  is 
expected  to  produce  a  strong  signal 
dispersion,  which  may  disturb  the  final 
complex  pole  analysis  of  the  scattering 
data. 

In  this  paper  we  analyse  narrow  band 
microwave  imaging  of  buried  objects  as 
inspired  by  much  employed  high  resolution 
radar  techniques  [3,4].  The  objective  is  to 
investigate  the  performance  of  microwave 
imaging  for  mine  searching  and 
identification,  exhausting  the  physical 
capabilties  of  microwaves  without  caring 
much  about  implementation  details. 

2.  Bistatic  Microwave  Imaging  with 
Focussing  Synthetic  Aperture 

In  order  to  obtain  a  good  spatial  resolution 
for  buried  targets  the  wave  length  used  for 
microwave  imaging  should  be  chosen  as 
small  as  possible.  Since  a  mine  buried  under 
a  20  cm  thick  layer  of  moderately  wet  soil 
should  be  still  visible,  the  microwave 
should  be  able  to  cross  a  1  cm  layer  of 
water  without  too  much  absorption. 
Looking  at  the  absorption  coefficient  of 
water  as  a  function  of  frequency  one  learns, 
that  one  cannot  go  beyond  S-band 

frequencies.  Therefore  spatial  resolution  for 
ground  penetrating  imaging  systems  based 
on  non-ionizing  electromagnetic  waves 
cannot  go  much  below  4  to  6  cm.  All 
numerical  studies  in  this  paper  use  a 
frequency  of  3.5  GHz. 

In  order  to  look  into  the  soil  one  has  to 
overcome  the  reflection  from  the  soil 
surface  which  is  superimposed  to  the 
scattering  echos  from  buried  objects.  For 
vertical  illumination  a  soil  surface  produces 
a  reflection  factor  r ,  given  by; 


r  =  («-!) /(«+!),  (1) 

where  n  is  the  index  of  refraction  of  the 
soil.  This  means  for  n  =  3  half  of  the 
incident  wave  amplitude  would  be  reflected 
at  the  ground  surface  and  would  interfere 
with  the  echo  from  buried  objects.  Since 
the  radar  range  resolution  is  not  sufficient 
to  separate  the  ground  surface  echo  from 
that  of  shallowly  buried  targets, 
suppression  of  the  ground  surface  echo  is 
required. 

The  reflection  from  a  plane  surface  is 
minimal  (ideally  zero)  if  the  plane  is 
illuminated  along  the  Brewster  direction  by 
a  wave  which  is  polarized  in  the  plane  of 
incidence.  The  Brewster  angle  is 
determined  by  the  index  of  refraction  of  the 
soil; 

XmOg  =n  (2) 

For  soil  this  angle  lays  in  the  range  from 
52°  to  85°.  If  the  illumination  misses  the 
Brewster  angle  by  3°  the  reflection  factor 
will  be  still  less  than  10%. 


terrain  surface  I 

Fig.  1 ;  Bistatic  microwave  mine  detection 
and  imaging  system 


The  spatial  resolution  increases  with  the 
aperture  dimension  of  the  receiving  antenna 
and  decreases  with  the  distance  from  the 
antenna  to  the  focussed  region.  Hence,  in 
order  to  achieve  a  high  resolution  power 
for  the  microwave  imaging  system  the 
receiving  aperture  should  be  as  close  to 
ground  as  possible.  Practical  operation 
conditions  require  a  sufficient  terrain 
clearance.  In  our  study  we  assume  that  the 
receiving  antenna  is  0.7  m  above  average 
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ground  level.  The  receiving  antenna  is 
designed  as  an  approximately  1  m  long 
linear  dipole  array.  A  much  bigger  array 
would  restrict  the  possible  use  of  the  mine 
searching  system  which  could  be  mounted 
in  front  of  a  small  cross-country  vehicle. 

The  resolution  in  the  direction  orthogonal 
to  the  linear  array  is  produced  by  the 
forward  move  of  the  complete  imaging 
system.  The  receiving  array  sweeps  a  two 
dimensional  ground  strip,  thereby  forming  a 
synthetic  aperture,  much  like  in  a  forward 
looking  SAR-system.  At  every  step,  the 
received  complex  scattering  amplitude  as 
seen  by  the  individual  dipole  elements  of 
the  receiving  array  will  be  stored  in  the 
memory  of  a  data  processing  unit. 

The  stepping  direction  will  be  called 
‘longitudinal’  whereas  the  orthogonal 
direction,  parallel  to  the  linear  receiving 
antenna,  will  be  addressed  as  ‘lateral’. 

The  longitudinal  length  of  the  synthetic 
aperture,  i.e.  the  number  of  real  antenna 
positions  used  to  compose  an  image  of  a 
point  in  real  space  determines  the 
resolution  of  the  system  in  the  longitudinal 
direction.  The  physical  length  of  the  array, 
i.e.  the  lateral  width  of  the  synthetic  apertur 
controls  the  lateral  resolution. 

Since  the  microwave  scattering  is 
concentrated  around  the  reflection  angle, 
an  aperture  length  in  stepping  direction  of 
the  order  of  1  m  would  be  sufficient. 
Larger  aperture  dimension  would  improve 
only  marginally  the  resolution  power  of  the 
system  at  the  expense  of  a  longer  image 
processing  time. 

Figure  1  illustrates  the  basic  system  design. 
Illuminating  and  receiving  antenna  are  kept 
at  a  riggid  horizontal  distance  from  each 
other.  In  our  study  this  distance  is  2  m.  The 
complete  system  steps  forward  in  the 
direction  from  the  receiving  array  to  the 
illuminating  horn.  The  receiving  array  is 
oriented  laterally.  The  horn  and  the  dipole 
elements  of  the  receiving  array  both  are 
inclined  by  angle  against  the  vertical. 


steppins  direction 

- ► 

receiving  array  positions  illuminating  hom  position 

1  2  3  4  1  2  3  4 


- 

- 

- 


target  plane 


Fig.2:  Synthetic  aperture  with  4  real  antenna 
positions. 

Figure  2  sketches  the  bistatic  system  in 
action,  forming  a  synthetic  plane  aperture 
which  consists  out  of  4  real  antenna 
positions. 

3.  Longitudinal  and  Lateral  Resolution 
Power 

The  processing  of  the  stored  scattering  data 
consists  in  focussing  the  data  relative  to  a 
grid  of  focus  points  which  is  projected  onto 
the  presumed  mine  laying  plane  (target 
plane).  For  a  chosen  focus  point  in  the 
target  plane  one  calculates  the  optical  path 
length  from  the  illuminating  antenna  via 
that  focus  point  to  the  individual  antenna 
elements  in  the  receiving  aperture.  The 
complex  scattering  amplitudes  as  actuality 
received  by  the  elements  of  the  synthetic 
aperture  get  multiplied  by  the  complex 
conjugate  phases  related  to  the  path  length 
from  illuminating  antenna  to  receiving 
element  via  the  chosen  focal  point  on  the 
target  plane.  The  resulting  phase  corrected 
amplitudes  are  summed  over  all  elements  of 
the  synthetic  aperture.  For  a  target  that  sits 
directly  at  a  focus  point  all  phase-corrected 
amplitudes  of  the  individual  elements  in  the 
S3mthetic  aperture  add  up  whereas  the 
phase-corrected  elementwise  scattering 
amplitudes  of  a  target  which  is  more  than  a 
resolution  distance  away  from  that  focus 
point  would  cancel  each  other  almost 
completely  in  the  summed  antenna 
response. 

This  focussing  is  performed  one  by  one  for 
all  focus  points  of  the  grid.  By  mapping  the 
distribution  of  the  focussed  antenna 
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response  onto  the  target  plane  one  obtains 
an  image  of  all  scattering  objects  which  are 
buried  within  a  certain  layer  around  the 
target  plane.  The  thickness  of  this  layer  is 
determined  formally  by  the  depth  of  fields 
of  the  imaging  system.  As  we  shall  see 
later  it  is  not  the  formal  depth  field  but  the 
much  shorter  electromagnetic  penetration 
depth  which  puts  actually  the  limit  for  the 
vision  into  the  soil. 

Mathematically  the  focussing  algorithm  is 
defined  as  follows.  Let  A{m,n)  be  the 
complex  scattering  amplitude  as  actually 
received  by  the  m-th  array  element  of  the 
array  at  the  n-th  step  position  over  the 
target  plane.  Furthermore  let  be  the 

coordinates  of  a  chosen  focus  point  on  the 
target  plane.  Then  the  total  amplitude 
received  by  the  focussed  aperture  is  given 
by: 

A/x,y)  = 

YjA(m,n)exp{-jk 

-  (3) 

+(x„-xf  +  (y^-yf 

+  +  (x^  +  2a  —  x)^  ]} 

h  distance  of  the  target  plane  from 

the  aperture  plane, 
k  wave  vector  2  ;r  / /c  (f= 

frequency,  c  =  speeed  of  light) 
a  half  distance  between  illuminatinig 

horn  and  receiving  linear  array, 
x„  longitudinal  position  of  the 

receiving  antenna  at  the  n-th  step, 
lateral  distance  of  the  m-th  array 
element  from  array  line  center. 

The  height  h  and  the  distance  a  are  related 
by  the  incident  angle; 

cotO^=h/a  .  (4) 

The  focussing  is  achieved  by  compensating 
the  phases  as  actually  received  by  the 
different  antenna  elements  by  the  path 
lengths  of  the  ray  pencil  which  passes  the 
chosen  focus  point. 

The  summation  in  (3)  runs  over  all 
elements  m  of  the  real  linear  array  and  all 
positions  n  which  form  the  synthetic 
aperture.  Note  that  the  focus  point  belongs 


to  the  Fresnel  region  of  the  receiving 
antenna.  The  distance  from  the  receiving 
antenna  to  the  focus  point  is  not  much 
larger  than  the  longitudinal  and  lateral 
aperture  dimension. 

The  formula  (3)  neglects  the  fact,  that  the 
focus  point  is  below  ground.  The  correct 
electromagnetic  path  from  transmitting 
horn  to  receiving  array  antenna  via  focus 
point  would  have  to  take  refraction  into 
account.  For  a  shallow  overburden  this 
would  cause  a  small  path  length  correction. 
In  the  numerical  simulation,  which  we 
report  later  on,  the  full  ‘optical’  path  length 
has  been  computed  and  used  for  the 
focussing.  Unfortunately  the  explicit 
expression  for  the  full  optical  path  length  is 
somewhat  bulky  to  write  down  explicitely, 
therefore  we  stay  with  the  deputy  formula 

(3). 

One  may  estimate  the  spatial  resolution  of 
the  synthetic  aperture  following  a  popular 
X/2-argument  in  optics  and  antenna  theory: 
First  compute  the  path  length  for  the  so- 
called  central  ray.  The  central  ray  starts  at 
the  wave  source,  scatters  off  the  target 
point  and  ends  at  the  centre  of  the  receiving 
aperture.  Next  compute  the  path  length  for 
the  so-called  boundary  ray.  The  boundary 
ray  shares  with  the  central  ray  the  same 
starting  and  scattering  points  but  it  ends  at 
the  boundary  of  the  receiving  aperture.  If 
both  path  lengths  differ  by  half  a 
wavelength,  then  the  scattering  signal 
caused  by  that  target  point  cancels  in  the 
received  total  antenna  signal. 

For  a  focussed  array  antenna  it  is  the 
relative  path  length  which  enters  the 
preceeding  argument.  The  relative  path 
length  is  obtained  from  the  original  path 
length  by  subtracting  from  it  the  length  of 
the  focus  reference  path.  The  focus 
reference  path  has  the  same  starting  and 
end  points  as  the  original  central  ray,  resp. 
boundary  ray,  however,  it  passes  through 
the  focus  point  instead  of  being  scattered 
off  at  the  target  point. 

According  to  the  X/2  argument  a  target 
point  close  to  a  chosen  focus  point  will  not 
contribute  significantly  to  the  signal 
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received  from  that  focus  point,  if  the 
relative  path  lengths  for  the  central  and  the 
boundary  ray  via  that  target  point  differ  by 
half  a  wavelength  or  more. 

Figure  3  displays  the  rays  which  are 
relevant  for  the  estimation'  whether  a  point 
T  contributes  to  the  image  of  an  object  at 
the  focus  point  F  or  not. 


Fig. 3:  Cut  in  longitudinal  direction:  central 
and  boundary  rays  of  the  synthetic  aperture; 
Ro  and  Ri  central  and  boundary  receiver 
antenna  position,  respectively,  So  and  Sj 
source  central  and  boundary  source  position, 
respectively,  T  target  position  ,  F  focus 
position 

Here  and  in  the  figures  below  we 
distinguish  the  rays  types  by  the 
convention: 

central  rays  fat  lines, 

boundary  rays  ->  normal  lines, 
rays  scattered  solid  lines, 

at  the  target 

reference  rays  broken  lines. 

passing  the 

focus 


The  distances  Rg  and  5)  are  both 

equal  to  half  of  the  longitudinal  dimension 
L  of  the  synthetic  aperture.  Let  x  denote 
the  distance  from  the  target  T  to  the  chosen 
focus  F.  The  horizontal  distance  between 
the  receiver  array  and  illuminating  horn  is 
2a.  Following  Figure  3  one  identifies 
RqT+T Sfy  as  the  path  length  for  the 
central  ray  scattered  at  the  target,  similarly 
Rq  F  +  F  Sf,  is  the  path  length  for  the 
associated  reference  ray.  The  length  of  the 
boundary  ray  scattered  at  the  target  is 


T+T S^,  and  finally  R^  F  +  F  is  the 
path  length  for  the  reference  ray  associated 
with  the  boundary  ray. 

The  path  length  difference  between  the 
signals  received  by  the  boundary  element 
and  the  central  element  both  focussed 
relative  to  point  F  is  composed  as  follows: 

W(x)  =  '^+Wo-^-Wo 


-RJ-TSj+R,F  +  FS, 


In  linear  approximation  for  x  this  path 
length  difference  becomes: 
xL 

W(x). 


4h 


•  + 


5a^ -h^ 


(5) 


The  factor  which  multiplies  the  distance  x 
in  equation  (5)  corresponds  to  the 
‘numerical  aperture’  in  optics.  For  a  rough 
estimate  one  may  neclect  non-linear 
corrections  in  the  longitudinal  aperture 
dimension  L.  The  contribution  of  the  target 
point  to  the  signal  as  received  by  the 
synthetic  aperture  focussed  at  point  F 
vanishes  if  the  path  length  difference  W  is 
equal  to  half  a  wave  length  X..  This  gives  an 
estimate  of  the  smallest  longitudinal  scale 
x^  which  can  be  resolved  by  the  the 
bistatic  synthetic  focused  aperture  system: 

/{L-cos-^e,)  (6) 

Except  for  an  additional  cos^^  factor  in 
the  denominator,  this  is  just  what  is  known 
from  synthetic  apertures  used  in  radar 
applications.  Note  the  factor  1/2  which  is  a 
bonus  not  valid  for  real  apertures. 
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Fig.  4:  Lateral  direction:  centrai  and 
boundary  rays  of  the  bistatic  imaging  system; 
Ro  and  Ri  centrai  and  boundary  receiver  array 
element,  respectively,  S  position  of  the 
illuminating  source. 

In  the  lateral  direction  the  synthetic 
aperture  is  actually  a  real  aperture  given  by 
the  length  of  the  receiving  array  line.  Figure 
4  sketches  the  ray  geometry  as  seen  from 
above. 

The  linear  receiving  array  antenna  has  a 
length  2  B  .  The  relevant  distance  scales  in 
Figure  4  are: 

R;T,=y,B; 

:^  =  FS  =  ^Ja^  +h^  ;  FT  =  y  (7) 
Remember  that  Rj  and  Rg  keep  a  distance 
h  to  the  target  plane.  The  focus  point  F  and 
the  target  point  T  lay  in  that  target  plane. 
The  path  length  difference  between  the 
signal  received  by  the  lateral  boundary 
element  and  the  central  element  both 
focussed  relative  to  point  F  one  obtains: 
W(y)^RJ+TS-R^ 

-¥s -TS +¥s 
The  source  position  cancels  exactly  in  the 
path  length  difference.  Up  to  non-linear 
terms  in  the  lateral  target  distance  from  the 
focus  point  one  gets: 


W(y)^y  B  (8) 

The  analogous  reasoning  as  for  the 
longitudinal  case  implies  a  lateral 
resolution: 

Tm.n  +/?"  IB  (9) 

Non-linear  contributions  of  the  lateral 
apertur  dimension  B  have  been  ignored  in 
this  estimate. 


Fig.  5:  Longitudinal  direction:  central  and 
boundary  rays  in  the  synthetic  aperture;  the 
illumination  is  by  a  remote  spatially  fixed 
source.  Rq  receiver  antenna  central  position, 
Ri  receiver  antenna  boundary  position,  T 
Target  point ,  F  focus  point 


The  lateral  resolution  changes  if  the 
illumination  occurs  by  a  spatially  fixed 
transmitting  antenna,  i.e.,  if  the  horn 
antenna  does  not  follow  the  forward 
stepping  of  the  receiving  array.  Figure  5 
shows  the  situation  if  the  ground  gets 
illuminated  by  a  remote  spatially  fixed 
transmitting  antenna. 

In  this  environment  the  path  length 
difference  between  the  signal  received  by 
the  lateral  boundary  element  and  the  central 
element  both  focussed  relative  to  point  Fis: 


W(x). 


xL 


^  ylh^  +a^ 


a^+h^ 


/-I 


a-L 


4  a^+h^ 


(10) 


This  leads  to  the  following  estimate  for  the 
longitudinal  resolution: 

+h^  l(L  cos^  9^)  (11) 

This  is  twice  as  coarse  as  what  is  obtained 
with  the  comoving  illuminating  antenna. 
Finally  the  depth  of  fields  for  the  bistatic 
imaging  system  has  to  be  determined.  If  the 
receiving  antenna  focusses  at  objects  in  a 
chosen  target  plane  (parallel  to  the  ground 


surface)  it  displays  also  objects  which  lay 
somewhat  above  or  below  that  target  plane. 
The  depth  range  for  which  targets  are 
reasonably  well  mapped  by  the  bistatic 
imaging  system  (while  focussing  at  a 
chosen  target  depth)  is  the  ‘depth  of  fields’. 


Fig.6:  Depth  of  fields:  central  and  boundary 

rays  for  a  synthetic  aperture;  actual  target  T  is 
below  focus  point  F. 

Figure  6  shows  the  central  rays  and  the 
boundary  rays  for  a  target  point  laying  in  a 
different  depth  than  the  chosen  focus. 

If  the  antenna  is  focussed  at  point  F  and  the 
scattering  occurs  at  target  T ,  then  the  path 
length  difference  becomes; 

W(z)  =  ^+TSj-R^-Wj 

-^-l\+^  +  F% 

The  distance  F  T  of  the  actual  target  from 
the  assumed  target  plane  is  called  z. 
Keeping  only  linear  terms  in  z  one  finds: 

W(z)^h  z-  fl/yl(a  +  y2L/  +h^ 

I -  I -  (12) 

+  J/yl(a~/2Lf  +h^  --2/yja^  +h^  ] 
Neglecting  quartic  and  higher  terms  in  the 
longitudinal  aperture  dimension  L  one 
gets: 

*  7  •  ■  cos  e,  [2-3  cos^  e,  ] 

4  a  +h 

(13) 

The  contribution  of  an  actual  target  to  the 
assumed  target  plane  can  be  ignored,  if  the 
distance  of  the  actual  target  to  that  plane  is 
larger  than; 

Zmin  «  2 2.  •  (h^  +a^)  /  [L^  •  cos0g 
■(2-3  cos^Og)] 


This  results  expresses  the  limits  of  the 
depth  of  field  which  are  due  to  the 
longitudinal  aperture  dimension  L.  The 
lateral  aperture  dimension  B  causes  a 
slightly  different  bound  for  the  depth  of 
field: 

^min  *^4^ •(/?"+ a" )/(F7cos(9g)  (15) 

Actually  the  formal  depth  of  field  is  not 
particularly  relevant  for  a  ground 
penetrating  microwave  system.  In  general 
the  penetration  depth  of  3.5  GHz 
microwaves  in  soil  is  much  shorter  3  X) 
than  the  formal  depth  of  field  which  was 
estimated  by  eqs.  (13)  and  (14). 

Figure  7  shows  the  calculated  intensity 
distribution  caused  by  a  single  pointlike 
target  as  seen  by  a  bistatic  imaging  sytem 
focussed  at  the  target  plane.  The  intensity 
is  given  by  the  modulus  squared  of  the 
amplitude  in  eq.  (3).  The  pointlike  target  is 
modelled  as  an  isotropical  scattering  center 
in  the  target  plane  at  coordinates  (x',  y) 
which  gets  illuminated  by  a  comoving 
transmitting  antenna  at  a  horizontal 
distance  2a  from  the  receiving  array  line 
position  Xn  ■ 

A(m,n)  =  exp{j-k 

.[^h^+(x„-x'f+(y„-y'f  (16) 

+  ■\-(x„^2a-x'f  ]} 

No  approximations  have  been  made  in  the 
evaluations  of  the  square  roots. 

0.1 


Fig.  7:  Image  of  a  pointlike  target  at  position 
(1 .2,0)  in  the  target  plane,  0.75  m  below  the 
aperture  plane  of  a  bistatic  synthetic  aperture 
imaging  system,  illuminating  antenna  moves 
together  with  the  receiving  array. 


(14) 
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The  longitudinal  resolution  shown  is  about 
0.3  m,  whereas  the  lateral  resolution  is  0,06 
m.  According  to  the  eqs.  (6)  and  (9),  an 
aperture  size  of  40  cm  by  120  cm  allows  a 
resolution  of  48  cm  by  9  cm  (longitudinal 
by  lateral),  at  a  target  plane  0.75  m  below 
the  receiving  array. 

Consider  now  a  bistatic  system  where  the 
illuminating  horn  remains  fixed  while  the 
receiving  array  is  sweeping  the  synthetic 
aperture.  If  the  illuminating  horn  is  not  too 
close  to  the  surface,  the  illumination  can  be 
modelled  as  a  portion  of  a  plane  wave  inci¬ 
dent  on  the  surface  under  the  angle  9b.  One 
obtains  an  expression  similar  to  equation 
(3)  for  the  total  amplitude  received  by  the 
focussed  aperture  of  that  system.  The 
elementwise  received  amplitudes  now  take 
the  form: 

A(m,n)  =  exp{/k 

■f^lh^+(x„-xf+(y„-y'f  (17) 

-x'  sindg]} 

The  focussing  phase  shift  is 
exp{-j  •  k  ■  fylh^+(x„-x'/+(y„-y'f 
-x’sinOg]}  (18) 

The  intensity  received  by  this  system  design 
fi-om  the  same  pointlike  target  is  now  dis¬ 
played  in  Figure  8.  Note  that  the 
longitudinal  resolution  is  now  at  least  twice 
as  coarse  as  for  the  system  with  comoving 
illumination.  This  behaviour  is  predicted  by 
eq.  (1 1)  as  compared  to  eq.  (6). 


Fig.  8:  Image  of  a  pointlike  target  at  position 
(1.2,0)  in  the  target  plane  0.75  m  below  the 
aperture  plane  of  a  bistatic  synthetic  aperture 
imaging  system:  the  illuminating  antenna  is 
spatially  fixed 


The  analysis  of  resolution  power  was  based 
solely  on  the  array  pattern  characteristics. 
The  directional  pattern  of  the  individual 
elements  in  the  receiving  antenna  array  was 
not  taken  into  account.  Furthermore  the 
isotropic  scatterer  model  of  the  buried  tar¬ 
get  is  a  severe  over-simplification.  The 
bistatic  scattering  cross-section  of  a  buried 
target  tends  to  concentrate  the  scattering 
field  around  the  reflection  direction.  Both 
directional  gains  would  reduce  the  aperture 
dimensions  which  are  required  in  order  to 
obtain  a  desired  resolution. 

4.  Signal  to  Clutter  Ratio 

As  important  as  the  resolution  power  is  the 
targets  contrast  relative  to  the  background 
for  the  imaging  device.  Since  an  ideally  flat 
ground  surface  would  cause  no  reflection 
the  background  is  produced  solely  by  sur¬ 
face  structures  and  inhomogeneities  in  the 
soil.  To  get  a  feeling  for  realistic  signal  to 
clutter  ratios  when  the  bistatic  imaging 
system  is  appUed  to  mine  searching,  we 
display  first  the  signals  of  mine-like  objects 
buried  under  an  ideal  overburden,  e.g.  ho¬ 
mogeneous  sand  with  a  smoothly  raked 
surface.  Next  we  compare  this  ideal  case 
with  the  image  produced  by  a  distinctively 
structured  soil  surface  without  any  buried 
mines. 


4.1  Scattered  Fields  of  Buried  Mines  and 
of  Soil  Surface  Structures 

In  this  paragraph  we  display  the  scattering 
fields  which  are  caused  by  various  buried 
targets  and  soil  surfaces  when  the  surface 
gets  illuminated  by  an  incident  plane  wave. 
The  incident  angle  for  the  illumination  is 
always  6b  =  58°.  The  incident  field  is  nor¬ 
malized  to  1  V/m,  the  frequency  is  3.5 
GHz. 

The  scattering  fields  are  obtained  by  em¬ 
ploying  numerical  simulation  method  as 
provided  by  a  well  established  finite  dif¬ 
ferences  time  domain  code  [5]. 


Two  types  of  soil  are  investigated: 


•  very  dry  sand  with  dielectric  constant 
e'=2.55  and  specific  conductivity 
k:=0.001  S m , 

•  wet  clay  with  dielectric  constant  s'=10 
and  specific  conductivity  k=0.1  S  m. 

For  both  soil  types  inhomogeneities  are  not 
included  in  the  model.  However,  the  model 
for  the  plastic  mine  also  may  be  taken  as  a 
hint  of  the  kind  of  echo  that  is  to  be  ex¬ 
pected  from  a  natural  inhomogeneity  in  the 
soil.  The  mine  models  are  buried  under  a  10 
cm  deep  uniform  soil  layer.  The  scattered 
fields  are  plotted  in  the  plane  of  incidence 
and  in  a  horizontal  cut  0.65  m  above 
average  ground  level.  The  horizontal  plot 
displays  only  one  half  of  the  scattering  field 
which  is  symmetric  relative  to  the  incident 
plane,  since  all  targets  in  this  study  have 
this  symmetry  and  the  polarisation  is  paral¬ 
lel  to  the  incident  plane. 

The  contour  plots  (Figs.  9  and  10  to  15) 
linearly  map  the  intensity  of  the  scattered 
field  into  gray  values.  White  corresponds  to 
the  maximum  intensity,  black  to  zero  inten¬ 
sity.  The  scales  at  the  plot  axis  state  the 
longitudinal,  the  vertical  and  the  lateral 
distances  in  meter.  The  mines  are  indicated 
in  the  incident  plane  plots,  close  to  longitu¬ 
dinal  coordinate  0.  The  plots  display  the 
scattered  fields  only,  the  incident  field 
coming  from  top  right  has  been  filtered  out. 
The  concentration  of  the  scattering  in  the 
reflection  direction  is  clearly  visible.  Due  to 
available  computer  resources  the  modelled 
soil  layer  was  limited  to  0.5  m  in  depth.  For 
the  dry  sand  this  is  somewhat  less  than 
what  is  desirable.  Therefore  the  displayed 
field  distribution  in  the  lower  part  of  the 
dry  sand  layer  has  to  be  taken  with  some 
care.  The  scattered  field  in  the  airspace, 
however,  is  believed  to  be  simulated 
reliably. 


metallic  anti-tank  mine  0.23 
in  dry  sand 

big  plastic  mine  in  dry  0.065 
sand 

rough  terrain  surface  in  0.35 
dry  sand 

square  metallic  mine  in  0.055 
dry  sand 

metallic  anti-tank  mine  0.038 
in  wet  clay 

rough  terrain  surface  in  0.35 
wet  clay 


Tab.  1:  Peak  values  of  scattered  fields  at 

0.65  m  above  ground  level 

Table  1  summarizes  the  peak  values  of  the 
microwave  scattering  signals  as  seen  by  a 
probe  0.65  m  above  average  ground  level 
for  the  scattering  objects  which  have  been 
examined  in  this  study. 

The  modelling  data  of  the  employed  mine 
types  are  as  follows: 

metallic  anti-tank  mine:  cylinder,  0  24 

cm,  height  8  cm,  ideally  conducting, 
big  dielectric  mine:  cylinder,  0  24 

cm,  height  8  cm,  non-conducting,  s'=  3.7, 
metallic  square  mine:  10  cm  squared, 

height  4  cm,  ideally  conducting. 

The  modelling  of  the  rough  surface  is 
represented  in  Figure  11.  The  displayed 
surface  segment  is  3  m  long. 


(b)  max=0.26  V/m 


Fig.  9:  Scattered  field  produced  by  a 
metallic  cylindrical  mine,  0  24  cm,  height  8 
cm  buried  in  dry  sand;  (a)  incidence  plane,  (b) 
horizontal  plane. 
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(a)  max=L8  V/m 


(a)  max=0.76  V/m 


(b)  max=0.072  V/m 

Fig.  10:  Scattered  field  produced  by  a 
dielectric  cylindrical  mine,  0  24  cm,  height  8 
cm,  8'=3.70,  buried  in  dry  sand;  (a)  incidence 
plane,  (b)  horizontal  plane 


Fig.  1 1 :  Coputer  modell  of  a  rough  terrain 
surface  with  sharp  relief  structures  ±5  cm. 
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(a)  max=2.8  V/m 


(b)  max=0.06  V/m 

Fig.  13:  Scattering  field  produced  by  metallic 
10  cm  square  mine,  height  4  cm  buried  in  dry 
sand;  (a)  incidence  plane,  (b)  horizontal  plane 


(a)  max=1.4  V/m 
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(b)  max=0.45  V/m 

Fig.  15:  Scattering  field  produced  by  a  rough 
terrain  surface  in  wet  clay;  (a)  incidence  plane, 
(b)  horizontal  plane 

4.2  Microwave  Images  of  Buried  Mines 
and  of  Soil  Surface  Structures 

The  scattered  field  as  displayed  in  the 
previous  section  is  the  input  data  for  the 
computational  image  formation  as  outlined 
in  Section  3.  Note  that  in  the  numerical 
simulation  the  illumination  was  mounted  at 
a  fixed  position  in  space  during  the 
recording  of  the  scattered  field.  The  reason 
for  this  set-up  was  purely  technical:  It  is 
much  cheaper  (i.e.  faster)  to  simulate  a 
spatially  fixed  illumination,  since  it  requires 
the  computation  of  just  one  scattered  field 
configuration.  The  original  bistatic, 
synthetic  aperture  imaging  system  uses  a 
comoving  illuminating  horn  antenna.  This 
implies  for  every  single  step  forward  a 
separate  computation  of  the  scattered  field, 
even  though  this  field  is  evaluated  only  at 
the  corresponding  position  of  the  receiving 
array  elements. 

For  the  signal  to  clutter  ratio  the  two 
different  methods  are  expected  to  give  the 
same  results.  As  to  the  resolution  power  it 
was  argued  in  Section  3  ,  that  the  spatially 
fixed  illumination  needs  a  longitudinal 
aperture  roughly  twice  as  large  as  the 
comoving  illumination  in  order  to  produce 
the  same  longitudinal  resolution.  The 
images  in  this  section  are  obtained  with  the 
following  aperture  dimensions:  0.8  m 
lateral,  0.32  m  longitudinal. 

To  evaluate  the  following  images  of  targets 
as  produced  by  the  studied  bistatic  imaging 
system-design  one  has  to  refer  to  the  gray 
values  scales  given  on  the  left;  side  of  the 
plots.  The  contrast  produced  by  the 
metallic  anti-tank  mine  is  almost  6-times 
stronger  than  that  of  the  big  plastic  mine. 
Comparing  the  gray  values  of  the  mine 


models  with  those  of  the  surface  models 
one  may  predict  a  signal  to  clutter  ratio  for 
the  imaging  of  mine-like  targets  achieveable 
within  the  considered  system-design.  As 
one  learns  from  a  glance  at  Tab.  2.  even  for 
dry  sand  the  results  do  not  look  too 
promising.  Of  course,  one  cannot  discuss 
the  identification  capability  solely  on  the 
basis  of  the  global  signal  to  clutter  ratio. 
Preknowledge  on  the  mine  shapes  would 
help  to  discriminate  clutter  against  a  real 
mine  target  if  the  resolution  power  of  the 
imaging  is  good  enough.  Unfortunately,  a 
resolution  of  5  cm  seems  not  sufficient  to 
support  pattern  recognition  methods  in 
mine  identification. 


metallic  anti-tank  mine 

1.26 

big  plastic  mine 

0.22 

square  metallic  mine 

0.21 

Tab.  2:  Signal  to  clutter  ratio  for  targets  buried 
in  dry  sand  under  a  surface  with  sharp  relief 
structures 


Fig.  16:  Image  produced  by  a  metallic 
cylindrical  mine  0  24  cm,  height  8  cm,  buried 
in  dry  sand 
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Fig.  17:  Image  produced  by  a  dielectric 
cylindrical  mine  0  24  cm,  height  8  cm,  buried 
in  dry  sand 


-0.^  0.1  X5,0  0.1 


Fig.  18;  Image  produced  by  a  rough  terrain 
surface  in  dry  sand 


Fig.  19:  Image  produced  by  a  metallic  10  cm 
square  mine,  height  4  cm,  buried  in  dry  sand. 


5.  Conclusions 

The  efficiency  of  an  electomagnetic 
imaging  principle  applied  to  buried  mine 
searching  and  identifaction  is  analysed 
without  worrying  much  about  details  of  its 
technical  realisation.  The  usable  frequencies 
for  the  technique  lay  in  the  S-band  around 
3.5  GHz.  This  implies  that  the  spatial 
resolution  cannot  be  much  finer  than  5  cm. 
Numerical  simulation  shows  that  for 
favourable  soil  conditions,  e.g.  dry  sand 
with  a  smooth  surface,  big  metallic  anti¬ 
tank  mines  and  even  big  plastic  mines  will 
actually  produce  reasonable  images  with 
the  expected  resolution.  Pronounced  sharp 
relief-type  surface  structures,  however,  may 
blur  the  mine  images  such  that  only  big 
metallic  anti-tank  mines  could  be  identified 
safely  under  a  10  cm  dry  sand  overburden. 
In  the  case  of  wet  clay,  mine  identification 
seems  to  become  almost  impossible. 
Attempts  to  overcome  the  poor  signal  to 
clutter  ratio  by  pattern  recognition 
techniques  are  hampered  by  the  coarse 
spatial  resolution  of  the  images. 
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1.  SUMMARY 

This  paper  will  discuss  radar  target  imaging  using  model 
based  spectral  analysis  applied  to  synthetic  aperture  radar 
(SAR)  data  or  inverse  synthetic  aperture  radar  (ISAR) 
data.  The  techniques  are  based  on  the  realization  that 
radar  target  SAR/ISAR  scattering  can  be  modeled  as 
autoregressive.  This  permits  an  autoregressive  radar 
scattering  model,  where  the  parameters  of  the  model 
permit  model-based  imaging. 

Finally,  we  will  show  a  number  of  different  example 
images  where  the  model  based  resolution  greatly  exceeds 
the  resolution  available  from  Fourier  based  techniques. 

2.  INTRODUCTION 

Wide  band  radar  scattering  from  a  target  is  often  used  to 
form  an  image  of  the  target.  The  image  is  usually  referred 
to  as  a  synthetic  aperture  radar  image  (SAR  image)  or  an 
inverse  synthetic  aperture  radar  image  (ISAR  image) 
because  the  transformation  behaves  as  if  a  very  large 
aperture  radar  antenna  is  synthesized  from  a  set  of 
incremental  data  taken  over  the  domain  of  the  (synthetic) 
aperture.  The  SAR  image  is  formed  by  moving  the  radar 
system  while  the  target  remains  stationary,  and  the  ISAR 
image  is  formed  by  holding  the  radar  at  a  fixed  location 
while  the  orientation  angle  of  the  target  changes. 

Conceptually,  the  transformation  involves  first 
transforming  the  frequency  domain  data  to  the  time 
domain  (and  thus  the  down  range  domain)  using  an 
inverse  spectral  estimation  usually  based  on  an  inverse 
Fourier  transform.  Then  the  complex  down  range  data  for 
each  down  range  value  is  transformed  to  the  Doppler 
domain  using  another  spectral  estimation,  also  usually  a 
Fourier  transformation  [48].  For  a  rotating  target,  some 
points  on  the  target  are  moving  toward  the  radar  (positive 
Doppler)  and  some  are  moving  away  (negative  Doppler). 
The  magnitude  of  the  speed  that  each  point  on  the  target 
(at  each  particular  down  range  increment)  is  moving 
relative  to  the  radar  is  proportional  to  the  cross  range 
location  of  the  point.  Thus  the  Doppler  domain  is  the 
cross  range  domain.  When  the  down  range  and  cross 
range  data  are  combined,  we  have  the  image  domain  of 
the  target.  Note  that  there  are  many  implementations  of 
this  concept.  Some  use  the  two  independent  Fourier 
transformations  as  described  (possibly  in  reverse  order). 


while  others  use  a  2-D  version  of  the  Fourier  transform. 
If  the  increments  are  (or  can  be  made)  uniform  and  equal 
(often  including  a  transformation  to  Cartesian  frequency 
space, /c  and^),  then  the  fast  Fourier  (or  inverse  Fourier) 
algorithm  (FFT  or  IFFT)  can  be  used  [29].  (We  will 
ignore  backprojection  and  tomograph ic-type 
transformations  for  now.) 

To  clarify,  a  schematic  representation  of  the 
transformation  from  a  measurement  (raw)  data  set  to  an 
image  is  shown  in  Figure  1.  The  raw  data  is  shown  in  the 
upper  left,  where  complex  radar  scattering  voltages  have 
been  measured  as  a  function  of  both  angle  and  frequency. 
If  we  take  an  inverse  (fast)  Fourier  transform  of  each 
row,  we  can  obtain  the  data  set  in  the  upper  right,  where 
each  row  (and  thus  each  aspect  angle)  is  now  a  time 
domain  (or  down  range  domain)  complex  impulse 
response.  (We  may  call  this  a  waterfall  plot  of  the 
target.)  If  we  take  a  (fast)  Fourier  transform  of  each 
column  of  this  data  set,  then  we  produce  a  cross  range 
transformation  for  each  down  range  column.  The  data  set 
in  the  lower  right  is  thus  a  down  range  versus  cross  range 
image.  Each  element  is  complex,  but  we  often  display 
this  image  by  taking  the  magnitude  of  each  image 
element  and  plotting  it  using  a  gray  scale  or  color  scale 
proportional  to  the  value  of  the  magnitude. 

Note  that  we  may  also  form  the  image  by  taking  a  (fast) 
Fourier  transform  of  each  column  of  the  raw  data  set. 

This  forms  the  Doppler  (or  cross  range)  value  for  each 
frequency  as  shown  in  the  lower  left  corner  of  the  plot. 
We  then  take  an  inverse  (fast)  Fourier  transform  of  each 
row  to  obtain  the  image  domain  data  set  in  the  lower 
right. 

Finally,  note  that  it  is  possible  to  transform  directly  from 
the  upper  left  to  the  lower  right  by  using  a  two 
dimensional  (fast)  Fourier  transform  (the  image  must  be 
flipped  left  to  right  to  correct  for  the  need  for  an  inverse 
Fourier  transform  from  the  left  side  to  the  right  side  of 
the  figure. 

It  is  important  to  realize  that  all  of  these  transformations 
are  linear  and  thus  reversible.  This  is  emphasized  in 
figure  1  by  the  two-way  arrows  between  domains.  Many 
types  of  image  processing  techniques  involving  image 
domain  gating  and  transformations  back  to  the 


Paper  presented  at  the  RTO  SET  Lecture  Series  on  "Advanced  Pattern  Recognition  Techniques" , 
held  in  Bristol,  UK,  14-15  September  1998;  Rome,  Italy,  17-18  September  1998; 

Lisbon,  Portugal,  21-22  September  1998,  and  published  in  RTO  EN-2. 
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measurement  domain.  The  autoregressive  techniques 
discussed  later  are  non-linear,  however,  and  thus  not 
reversible  in  all  cases.  Although  the  reader  should  keep 
these  reversing  concepts  in  mind,  a  discussion  of  these 
reversible  transformations  and  the  impact  with  respect  to 
superresolution  techniques  is  beyond  the  scope  of  this 
paper  and  will  not  be  specifically  discussed  here. 

One  requirement  for  such  imaging  algorithms  is  that  there 
must  be  sufficient  information  about  the  trajectory  of  the 
radar  or  the  target  so  that  the  aspect  angle  is  well  known. 
Phase  variations  due  to  motion  other  than  simple  rotation 
(e.g.  translation)  must  be  removed  by  compensation 
based  on  the  known  trajectory  (this  is  called  motion 
compensation).  Also,  the  frequency  increment  and  the 
angle  increment  must  be  small  enough  so  that 
unambiguous  (not  aliased)  transformations  to  the  image 
domain  can  be  done. 

Finally,  the  frequency  band  and  the  angle  span  must  be 
large  enough  to  provide  range  and  cross  range  resolution 
that  is  meaningful  for  the  target  of  interest.  That  means 
that  the  range  or  cross  range  increment  must  be  small  (in 
some  sense)  with  respect  to  the  extent  of  the  target.  This 
paper  is  about  the  case  where  the  last  criterion  is  not  met. 
Either  the  frequency  band  or  the  angle  span  (or  both)  is 
too  small  to  yield  sufficient  resolution  in  the  image. 

There  are  a  number  of  situations  that  can  limit  the 
frequency  band.  First,  a  simple  component  limitation 
(antennas,  signal  sources,  receiver  bandwidth)  is  possible 
for  the  radar.  Alternatively,  the  propagation  medium  may 
be  limiting  the  resolution.  (Ground  penetration,  building 
penetration,  foliage  penetration  and  plasma  penetration 
radars  suffer  from  this  limitation.)  Finally,  the  radar 
target  itself  may  limit  the  useful  observational  radar  band. 
It  may  be  a  narrow  band  resonant  structure  such  as  a 
cavity,  an  active  device  or  a  periodic  structure. 

There  are  also  a  number  of  things  that  can  limit  the 
aspect  angle  span.  The  radar  platform  may  not  be  able  to 
move  in  an  optimum  trajectory  for  SAR,  or  in  the  case  of 
an  uncontrolled  ISAR  target,  the  target  simply  may  not 
move  in  such  a  way  as  to  yield  sufficient  angle  span.  In 
many  cases  for  ISAR,  the  target  is  an  aircraft  or  a  ship, 
and  it  will  be  undergoing  radial  motion,  pitch,  roll  and 
yaw  as  well  as  non-uniform  angular  rotation.  These 
uncontrolled  motions  limit  the  useful  regions  of  the  angle 
span  to  those  where  the  effects  are  small,  or  known. 
Finally,  it  is  also  important  to  observe  that  the  useful 
angle  span  may  be  limited  by  the  degradation  in  the 
Doppler  versus  cross  range  linear  relationship 
approximation  for  large  aspect  angle  domains. 

It  can  be  seen  that  the  basic  steps  in  the  imaging 
transformations  require  spectral  estimation.  So  far,  we 
have  only  mentioned  Fourier  transformations.  The 
requirements  of  limited  angle  or  frequency  span  imaging, 
however,  require  that  we  develop  techniques  for  spectral 
esolution  that  exceed  that  available  with  the  Fourier 


transform.  A  large  class  of  model-based  modern  spectral 
estimation  techniques  have  recently  been  developed 
[54,55],  and  these  so  called  superresolution 
transformations  in  the  context  of  SAR/ISAR  imaging  will 
be  discussed  next. 


3.  MODEL  BASED  SPECTRAL  ESTIMATION  IN 
THE  SAR/ISAR  CONTEXT 

Superresolution  SAR/ISAR  imaging  techniques  are  based 
on  an  important  radar  target  scattering  characteristic. 

That  is  that  in  most  radar  scattering  situations,  the  radar 
target  behaves  as  if  it  were  a  relatively  small  number  of 
point  scatterers.  This  means  that  as  the  frequency  is 
scanned  over  the  observation  band  of  frequencies,  each 
subcomponent  scatterer  scatterers  a  signal  that  is  constant 
in  amplitude  and  has  a  phase  associated  only  with  its 
range  delay.  Since  the  phase  of  the  scattering  from  each 
subcomponent  is  determined  by  the  round-trip  radar 
range  delay  distance  in  wavelengths,  a  linear  scan  in 
frequency  results  in  a  linear  sweep  in  radar  wavelength 
and  thus  a  linear  phase  variation  as  a  function  of 
frequency.  We  can  see  that  as  the  radar  frequency  is 
scanned,  the  subcomponent  signal  is  constant  in 
amplitude  and  varies  linearly  in  phase.  It  has  a  complex 
sinusoidal  variation  as  a  function  of  frequency. 

As  a  function  of  aspect  angle,  each  subcomponent 
appears  to  be  moving  in  a  circle.  Over  a  “small  enough” 
sector  of  angles,  the  motion  can  be  approximated  as  a 
straight  line.  The  apparent  speed  of  each  subcomponent 
depends  on  the  distance  from  the  center  of  rotation.  Thus 
the  subcomponent  scatterers  appear  to  have  a  linear 
phase  variation  as  a  function  of  aspect  angle,  with  the 
phase  change  rate  proportional  to  cross  range  distance. 
Thus,  the  cross  range  distance  is  proportional  to  the 
(spatial)  spectral  frequency  of  the  subcomponent 
scatterer. 

The  overall  target  appears  to  the  radar  as  a  relatively 
small  number  of  such  subcomponent  scatterers  and  thus 
the  total  received  radar  signal  is  a  finite  linear  sum  of 
such  terms.  The  overall  signal  is  a  linear  sum  of  a 
relatively  small  number  of  terms,  each  one  of  which  is  a 
complex  sinusoid  as  a  function  of  frequency. 

We  have  thus  moved  from  the  general  situation  where 
there  is  no  restriction  on  the  behavior  of  the  scattered 
signal  to  the  situation  where  we  have  a  model  that  says 
we  have  a  relatively  small  number  of  complex  sinusoids 
as  a  function  of  frequency  and  angle.  We  can  use  this 
model  to  restrict  the  class  of  possible  solutions  for  the 
scattering  behavior  and  thus  use  a  very  powerful  set  of 
algorithms  that  take  advantage  of  the  a-priori  knowledge 
we  have  of  the  radar  scattering  behavior  as  a  function  of 
frequency  and  angle. 

We  start  out  by  modeling  the  scattered  signal  as  a  sum  of 
complex  sinusoids  [56,63].  Thus  in  the  frequency 
domain  (at  each  fixed  aspect  angle),  we  have 
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=  S  4  •  exp(-7^^  (1) 

k  ^ 

where 

Aj^  is  the  amplitude  of  the  sub-scatterer 
df^  is  the  down-range  distance  between  the  phase 
zero  reference  and  the  sub-scatterer 
is  the  frequency 

is  the  noise  associated  with  the  data  at  the  n* 
frequency 

c  is  the  speed  of  light 
and 

is  the  total  complex  scattering  from  the  target  at 
the  frequency. 

Now  we  make  a  very  important  realization.  We  note  that 
any  finite  sum  of  sinusoids  is  autoregressive.  This  means 
that  the  data  point  of  the  sequence  can  be  accurately 
predicted  by  the  previous  (or  future)  N  data  points  (A^ 
relatively  small).  Thus  for  a  sum  of  L  complex  sinusoids, 
we  have  an  autoregressive  process  of  order  L.  In  general, 
we  can  write 

L 

=-S«.  +  (2) 

(=1 


/i(r)= — — - - 

where  h(t)  is  the  power  profile  (actually  the  square  of  the 
impulse  response).  Note  that  with  no  loss  of  generality  ai 
can  be  set  equal  to  one.  Examination  of  this  equation 
shows  that  there  is  no  inherent  limit  in  the  resolution  of 
the  time  domain  estimate.  This  is  due  to  the  fact  that  the 
model  has  perfect  knowledge  of  the  signal  once  the 
parameters  of  the  model  have  been  estimated.  Of  course, 
if  the  parameters  are  poorly  estimated,  the  model  has 
perfect  knowledge  of  the  wrong  system  and  thus  we  will 
have  a  very  high  resolution  estimate  of  the  wrong  value 
for  the  spectral  estimate.  Never  forget,  we  must 
distinguish  between  resolution  and  accuracy! 

Note  that  the  expression  in  (4)  is  an  evaluation  of  (2  or  3) 
along  the  unit  circle  in  the  complex  plane.  As  a  result, 
the  location  of  the  poles  and  zeros  of  (4)  determine  the 
behavior  of  (2  or  3). 

The  entire  field  of  model  based  spectral  estimation 
depends  on  the  process  used  to  find  the  coefficients  that 
must  be  inserted  in  the  z-transform  (4).  This  process  can 
be  understood  for  FLP  by  expanding  equation  (2)  into  a 
matrix  equation  {un  =  0). 


(forward  linear  prediction  (FLP)  equation  of  order  L) 
where 

is  the  prediction  coefficient 
is  the  (n-i)^  term  in  the  data  series 
is  the  noise  (or  error)  in  the  prediction  for  term  n 
and 

is  the  predicted  value  in  the  series. 

Similarly,  we  also  have  the  backward  linear  prediction 
(BLP)  equation 

L 

'=^  (3) 

where  *  indicates  the  complex  conjugate. 

Note  that  the  fl,’s  are  unchanged.  We  note  that  in  general, 
there  is  a  similar  form  for  the  forward-backward  linear 
prediction  (FBLP)  equation.  This  takes  advantage  of  the 
unchanged  nature  of  the  cz/’s. 

Once  we  are  able  to  determine  the  coefficients  of  these 
prediction  equations,  we  can  obtain  a  power  spectral 
estimation  of  the  original  data  series  by  a  z-transform.  In 
this  example,  where  the  original  data  is  a  set  of  scattering 
measures  as  a  function  of  frequency,  this  represents  a 
transformation  from  the  frequency  domain  to  the  time  (or 
down  range)  domain. 


This  matrix  representation  of  the  forward  linear 
prediction  (FLP)  equation  helps  to  show  that  the 
equation  simply  represents  an  overdetermined  set  of 
linear  equations.  There  are  N-\-l  equations  in  p  unknowns 
where  the  size  of  the  measurement  set  is  N+1  and  the 
number  of  sinusoids  is  p.  The  solutions  for  the  a’s  thus 
involve  methods  of  finding  efficient  and  effective 
solutions  to  this  set  of  equations. 

Similar  representations  are  also  available  for  the 
backward  linear  prediction  (BLP)  equations  and  the 
forward-backward  linear  equations  (FBLP)  [34]. 

4.  MATRIX  SOLUTIONS 

Using  either  the  FLP  or  the  FBLP  approach  thus  involves 
solving  a  set  of  overdetermined  linear  equations  based  on 
a  set  of  measured  values.  There  are  two  main  methods 
used  to  solve  these  equations.  The  first  is  the  least 
squares  method  and  the  second  is  the  total  least  squares 
method  [54,55]. 

The  underlying  assumption  of  a  least  square  solution  is 
that  the  errors  in  the  set  of  equations  are  imbedded  in  the 
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prediction  column.  In  fact,  if  the  elements  of  the  matrix 
are  noisy,  the  column  space  of  the  matrix  is  only  an 
approximation  of  the  solution  space  for  the  equations. 
(The  total  least  squares  method  is  used  to  improve  on  this 
assumption.) 

The  matrix  (5)  equation  can  be  written  in  general  as 

Ya  =  g.  (6) 

If  the  matrix  Y  represents  data  values  of  a  perfect 
autoregressive  (AR)  process,  the  matrix  would  have  rank 
p.  In  practice,  the  matrix  is  full  rank  because  there  is 
always  some  noise  present.  As  a  result,  all  rows  of  the 
matrix  are  linearly  independent  and  there  is  no  exact 
solution  to  the  set  of  equations. 

The  consequence  of  there  being  no  exact  solution  to  the 
set  of  equations  is  that  the  vector  g  is  not  in  the  vector 
space  that  is  spanned  by  the  columns  of  the  matrix  Y. 

The  nature  of  the  least  squares  solution  is  to  consider  a 
vector  g+r,  where  r  is  a  small  perturbation  vector.  If  r  is 
chosen  so  that  g+r  is  in  the  column  space  of  Y  so  that  r  is 
minimized,  the  associated  a  is  called  the  Least  Squares 
solution  of  Ya=g. 

In  more  geometric  terms,  the  vector  g  is  projected  onto 
the  column  space  of  Y  and  the  vector  r  is  the 
perpendicular  error  vector  between  the  column  space  and 
the  g  vector.  The  solution  vector  then  represents  the 
linear  combination  of  the  column  vectors  in  the  column 
space  that  exactly  equal  the  projections. 

It  is  known  from  linear  algebra  that  the  projection  of  the 
vector  g  onto  the  column  space  of  Y  can  be  represented 
as  Y”  g  (where  H  represents  the  Hermetian  transpose). 
Therefore,  multiplying  both  sides  of  Ya=g  by  Y”  yields 

Y"Ya  =  Y“g  (7) 

It  follows  that  the  least  squares  solution  of  (6)  can  be 
represented  as 

a  =  (Y"Y)-'Y**g  (8) 

The  inverse  of  Y^Y  is  known  to  exist  because  Y  is  full 
rank. 

And  there  we  have  it;  a  least  Squares  solution  to  the 
overdetermined  set  of  equations  given  in  (5). 

Note  that  the  underlying  assumptions  of  a  Least  Squares 
solution  are  that  the  errors  are  in  the  g  vector.  That  is  , 
the  elements  of  the  matrix  Y  are  perfectly  known.  This  is 
the  assumption  made  when  the  g  vector  is  projected  onto 
the  column  space  of  Y.  In  fact,  if  the  elements  of  the 
matrix  are  noisy,  which  they  are  in  practice,  the  column 
space  of  the  matrix  is  only  an  approximation  of  the 
solution  space  for  the  equation.  The  Total  Least  Squares 
method  is  then  used  to  improve  on  this  assumption, 


especially  in  the  case  where  the  elements  of  the  matrix  Y 
are  assumed  to  have  noise  associated  with  them.  A 
detailed  discussion  of  the  TLS  method  is  beyond  the 
scope  of  this  paper,  and  can  be  found  elsewhere. 

5.  THE  BURG  METHOD 

A  third  method  of  finding  the  model  parameters  of  a 
FBLP  model  is  called  the  Burg  Method  [54,56].  Burg 
said  that  the  parameters  should  be  selected  in  such  a  way 
as  to  maximize  the  entropy  of  the  signal  outside  the 
measurement  window.  (In  fact,  the  method  is  often 
called  the  Maximum  Entropy  Method.)  Using  this 
approach,  and  a  modified  version  of  the  Levinson 
algorithm,  the  model  parameters  can  be  found.  The 
details  of  the  Burg  algorithm  are  found  in  the  references. 

6.  APPLICATIONS  OF  SUPERRESOLUTION 
TRANSFORMATIONS 

A  number  of  specific  imaging  examples  will  next  be  used 
to  demonstrate  these  techniques,  moving  from  one¬ 
dimensional  imaging  to  two  dimensional  imaging. 

6.1  Angle  Domain  Estimation  Example 

Radar  scattering  measurements  are  often  done  in  a 
compact  RCS  measurement  range  [57].  This  system  uses 
a  large  reflector  in  the  near  field  of  the  test  area  to 
transform  a  spherical  wave  from  a  small  feed  to  a  plane 
wave  in  the  test  area.  Problems  arise  from  spurious 
scatterers  in  the  neighborhood  of  the  reflector  that 
produce  spherical  (near-field)  signals  in  the  test  zone. 
Locating  the  source  of  such  error  terms  can  be  done  by 
measuring  the  field  received  in  the  test  zone  and  then 
transforming  to  a  direction  of  arrival  map  to  trace  signals 
back  to  their  origin.  This  is  especially  difficult  in  this 
situation  because  the  error  signal  is  a  spherical  (or 
cylindrical)  wave  and  thus  if  one  attempts  to  increase  the 
resolution  of  the  direction  of  arrival  estimate  by  using  a 
larger  aperture  scan,  the  spherical  nature  of  the  signal 
causes  defocusing. 

As  a  study  of  this  type  of  problem,  vertical  scan  was 
taken  in  the  Ohio  State  University  (OSU)  compact  RCS 
measurement  range,  and  small  increments  of 
displacements  were  used  to  compute  the  direction  of 
arrival  [41].  One  result  is  shown  in  figure  2a.  Note  that 
the  response  due  to  the  main  beam  gives  a  large  signal 
lobe  at  zero  aspect,  but  the  smaller  error  terms  are  not 
seen.  If  we  take  larger  spans  over  the  test  zone,  the 
previously  mentioned  defocusing  also  hides  the  effect 
due  to  the  spurious  error  signal.  If  an  autoregressive 
model-based  spectral  estimation  technique  is  used  to 
compute  the  direction  of  arrival  spectrum  at  a  set  of 
vertical  increments  in  the  test  zone,  we  can  obtain  the 
plot  shown  in  figure  2b.  A  low  order  autoregressive 
model  was  used  here.  Note  that  the  spurious  signal  is  now 
visible.  The  direction  of  arrival  varies  depending  on  the 
location  of  the  scan  in  the  test  zone.  This  can  be  made 
meaningful,  if  directional  lines  are  plotted  from  each  sub¬ 
scan  region  in  the  test  area  back  toward  the  compact 
range  main  reflector  as  shown  in  figure  3.  Note  that  the 
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lines  converge  at  the  junction  where  the  parabolic 
curvature  of  the  main  reflector  transitions  into  a 
cylindrical  skirt.  We  can  estimate  the  amplitude  of  the 
error  signal  with  respect  to  the  main  beam  signal  for  each 
directional  line,  and  these  are  shown  on  the  plot. 

This  is  a  specific  example  where  model  based  spectral 
estimation  is  very  effective  at  extraction  of  spurious 
signals  in  an  environment  where  it  would  otherwise  not 
be  possible. 

6.2  Range  domain  estimation 

The  previous  example  showed  that  model  based  spectral 
estimation  techniques  can  be  used  to  estimate  direction  of 
arrival  by  processing  spatial  domain  data.  It  is  also 
possible  to  use  these  techniques  to  transform  from  the 
frequency  domain  to  the  time  and  thus  down-range 
domain  [28,34,53,56].  An  example  case  is  shown  in 
figure  4.  This  example  was  done  for  a  generic  missile 
shape  consisting  of  a  cylinder  with  a  frustum  (nose  cone) 
and  rounded  edge  wings  and  a  vertical  blade.  The  overall 
length  was  153  cm.  Data  over  the  frequency  band  from  5 
to  6  GHz  will  be  shown  here.  Now,  since  we  have  only  1 
GHz  of  radar  bandwidth,  our  range  resolution  (-3dB  main 
time  domain  response  width)  using  Fourier  transforms  is 
1  ns.  (the  inverse  of  the  bandwidth).  An  example  down 
range  profile  is  shown  in  figure  4  as  a  dashed  line.  If  we 
use  an  autoregressive  algorithm  applied  to  the  set  of 
complex  frequency  domain  values  to  estimate  the  down 
range  profile,  we  can  obtain  the  solid  line.  This  line  can 
now  show  (1)  the  tip  of  the  frustum,  (2)  the  frustum- 
cylinder  junction,  (3)  the  leading  edge  of  the  wings  and 
blade,  (4)  the  wing  and  blade  tips,  (5)  the  trailing  edge  of 
the  blade,  (6)  the  cylinder  end,  the  discontinuity  and  (7) 
the  across  the  rear  caustic  term.  Remember  that  one  of 
the  most  important  characteristics  of  these  model  based 
techniques  is  not  the  increase  in  time  accuracy  of  a  single 
scattering  term,  but  the  ability  to  distinguish  between  two 
closely  space  scatterers  such  as  found  at  the  end  of  this 
cylinder. 

We  can  form  a  backprojection  or  tomographic  image  of 
this  target  if  we  sum  up  the  contribution  of  each  down 
range  profile  across  the  image  domain  at  each  aspect 
angle.  For  the  Fourier  based  profiles  (equation  4),  we 
have  a  signed  (positive  and  negative  voltage 
contributions)  summation  where  small  error  terms  will 
average  out  in  regions  where  there  is  no  target.  This 
result  (only  over  180°  of  observation  domain)  is  shown  in 
figure  5a.  For  the  autoregressive  down  range  profiles,  it 
is  the  nature  of  the  spectral  estimation  that  only  power 
estimates  are  available.  This  means  that  small  error  terms 
will  add  coherently,  rather  than  integrating  to  zero.  On 
the  other  hand,  we  note  that  there  are  in  fact,  no  “noise” 
terms  as  such  in  the  down  range  profiles.  Each  profile  is 
deterministic  (although  the  terms  may  have  errors)  and 
thus  there  is  no  random  noise  type  behavior  in  the  no¬ 
target  regions  of  the  image.  Thus  when  the  tomographic 
summation  over  the  target  image  domain  is  done  (also 


180°  only),  it  is  possible  to  obtain  the  image  shown  in 
figure  5b.  Note  that  the  vertical  blade,  the  wing  tips,  and 
both  edges  of  the  trailing  end  of  the  cylinder  can  be  seen 
in  this  image  with  much  greater  resolution  than  the 
Fourier  based  image. 

6.3  Hybrid  Image  Domain  Estimation 

If  the  measurement  data  set  (such  as  is  shown  in  the 
upper  left  of  figure  1)  is  constrained  in  only  one 
dimension,  then  we  need  only  use  the  model  based 
spectral  estimation  technique  in  the  transformation  along 
the  dimension  that  is  constrained.  Two  examples,  based 
on  this  constraint  will  be  given  below. 

6.3.1  Small  aircraft  target 

Consider  the  example  from  a  x-band  radar  imaging  test 
series  on  a  small  private  aircraft  as  shown  in  figure 
6.  [39, 3 1,28,20].  In  this  radar  test,  the  radar  was  fixed  in 
location  and  operating  with  a  radar  wavelength  of 
approximately  3  cm.  The  radar  operated  from  9150  to 
9898.8  MHz  in  128  steps  of  5.85  MHz.  The  small 
propeller  driven  aircraft  was  flying  in  a  small  diameter 
circle  while  several  miles  from  the  radar.  Frequency 
scans  were  taken  every  25.6  ms.  For  this  set  of  data,  128 
such  scans  corresponds  to  approximately  4,9  degrees 
azimuth  change.  The  raw  data  was  first  phase  shifted  in 
such  a  way  as  to  remove  the  radial  component  of  velocity 
from  the  data.  The  result  is  a  data  set  that  behaves  as  if 
the  aircraft  is  fixed  in  location,  but  rotating  about  a  point. 
(The  process  is  called  motion  compensation.)  The  image 
shown  in  figure  6a  is  the  result  of  a  two-  dimensional 
Fourier  transform  using  only  32  data  points  from  the 
angle  span.  The  full  frequency  set  was  used  (thus  no 
special  limitation  in  the  down  range  domain).  The  data 
were  cosine  windowed  and  zero  padded  to  128  points  in 
both  down  range  and  cross  range.  Note  the  image  of  the 
aircraft  (flying  away  from  the  radar  in  a  turn  to  the  left). 
Note  also  the  diagonal  lines  near  the  top  of  the  image. 
This  is  propeller  modulation  effects  producing  false 
image  terms  (an  image  alias).  The  image  shown  in  figure 
6b  was  produced  by  first  transforming  to  the  down-range 
domain  using  an  inverse  fast  Fourier  transform  (IFFT) 
(top  left  to  top  right  in  figure  1).  This  effect  has  the 
benefit  of  reducing  the  number  of  scattering  terms  in  each 
subsequent  transform  because  there  is  only  a  small 
number  of  scattering  terms  at  each  down  range  location. 
The  upper  right  data  matrix  (shown  in  figure  1)  was  then 
transformed  column-by-column  to  the  image  domain  by 
using  an  FBLP  spectral  transform  with  an  autoregressive 
order  of  only  4.  (I.E.:  it  allowed  at  most  4  scatterers  in 
each  down-range  cut.)  Note  the  highly  resolved  image  of 
the  aircraft  near  the  center  of  the  image.  Also  note  the 
propeller  modulation  terms  manifested  as  diagonal  lines 
near  the  top  of  the  image. 

This  entire  process  can  be  done  again,  but  with  only  8 
aspect  angles  included.  The  FFT-based  image  for  this 
case  is  shown  in  figure  7a  while  the  FBLP  based  image 
(also  order  4)  is  shown  in  figure  7b.  Note  that  the  image 
of  the  aircraft  has  significantly  degraded  in  the  FFT 
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based  image,  but  that  the  basic  aircraft  size  and  shape  can 
still  be  seen  as  well  as  the  propeller  modulation  terms  in 
the  FBLP  based  image. 

6.5.2  Target  large  in  wavelengths 

One  of  the  criticisms  of  these  model  based  spectral 
estimation  techniques  is  that  the  number  of  spectral  terms 
must  be  relatively  small  for  the  technique  to  be  successful 
(only  relatively  low  model  order  will  work).  We  include 
an  example  of  3  cm  wavelength  imaging  of  a  122  meter 
long  ship  to  demonstrate  the  capabilities  of  the  hybrid 
image  in  this  application  [1,6]. 

In  this  exercise,  we  obtained  RCS  scattering  data  for  a 
ship  that  was  moving  in  a  1  mile  diameter  circle 
approximately  3  miles  out  to  sea.  The  radar  was  on  the 
shore.  There  was  a  limitation  in  the  ability  to  image  this 
ship  using  Fourier  techniques  because  the  pitch,  roll  and 
yaw  effects  tended  to  dominate  and  defocus  the  image  if 
data  were  collected  over  a  time  long  enough  to  permit 
significant  motion  in  those  dimensions.  On  the  other 
hand,  if  only  a  short  data  sequence  was  used,  the  aspect 
angle  span  was  too  short  for  useful  cross  range  resolution 
in  the  image.  Motion  compensation  was  also  an  issue  and 
angular  perturbations  caused  estimation  of  the  angular 
velocity  of  the  vessel  to  be  in  error. 

One  of  the  FFT  based  images  is  shown  in  the  upper  left 
plot  in  figure  8  for  this  ship.  The  down  range  transform 
used  the  full  9.10  to  9.228  GHz  band  and  thus  realized  a 
down  range  resolution  of  1.2  meters.  The  angle  domain 
span  used  to  form  the  image  was  only  from  156  to  158.7°, 
however,  and  this  limited  the  cross  range  resolution  so 
that  the  image  of  the  ship  was  quite  “smeared  out”  in 
cross  range.  The  result  of  the  hybrid  model  based  spectral 
estimation  for  cross  range  imaging  (model  order  only  7) 
is  shown  in  the  right  hand  plots  of  this  figure.  (Note  the 
scaled  drawing  of  the  ship  in  the  lower  left  hand  portion 
of  the  figure.)  With  proper  motion  compensation 
applied,  the  overall  length  and  shape  of  the  ship  can  be 
seen  in  the  image. 


6.4  Data  Extrapolation  Imaging 

One  other  approach  to  superresolution  SAR/ISAR 
imaging  will  be  discussed  here.  Note  from  equations  2 
and  3  that  these  are  prediction  equations  (the  coefficients 
are  called  predictor  coefficients)  in  the  sense  that 
elements  of  the  measurement  data  set,  (the  jc^  values  in 
the  equation)  are  predicted  based  on  a  model  based  linear 
extrapolation.  The  prediction  coefficients,  a,- ,  are 
derived  based  on  the  errors  between  the  predicted  values 
and  the  actual  measurement  values.  Note  that  once  the 
values  of  the  predictor  coefficients  are  available, 
however,  it  is  possible  to  make  a  model  based 
extrapolation  of  the  data  outside  the  measurement  data 
set.  In  the  diagram  of  the  SAR/ISAR  process  given  in 
figure  1,  this  means  that  if  one  of  the  dimensions  of  the 
measured  data  set  (either  the  frequency  band  or  the  angle 
span)  is  not  large  enough  to  permit  the  image  resolution 


required,  then  the  a/’s  can  be  used  to  extrapolate  outside 
the  measurement  space.  Once  this  has  been  done,  the 
typical  Fourier  based  imaging  algorithm  can  be  applied  to 
the  expanded  data  set  [22,29,30]. 

The  process,  thus  is  to  extend  the  experimental 
measurement  data  set  (the  upper  left  matrix  in  figure  1) 
and  then  form  the  image  using  Fourier  based  techniques. 

One  limitation  of  this  technique  is  that  there  may  be 
measurement  errors  in  the  original  data  set  that  cause 
inaccuracies  in  the  a/’s.  This  will  lead  to  degradation  of 
the  predicted  values  as  the  extension  distance  increases. 

Also,  this  is  a  model  based  technique,  and  if  the  physical 
situation  is  such  that  the  data  do  not  fit  the  model 
(especially  outside  the  measurement  domain),  then  the 
prediction  will  be  inaccurate  as  the  domain  of  the 
prediction  increases.  As  an  example,  note  that  the  phase 
progression  (the  distance  between  the  radar  and  the  target 
subcomponent)  does  not  change  linearly  as  a  function  of 
aspect  angle.  This  means  that  the  cross  range  behavior  of 
the  model  based  extrapolation  will  not  be  effective  in 
predicting  the  values  beyond  the  measurement  angle 
span.  To  avoid  this  problem,  one  can  “polar  format”  the 
scattered  field  data  and  can  then  extrapolate  the  polar 
formatted  data  along  the  two  Cartesian  components  of 
frequency  that  result  (see  [28,29,34]).  (Note  that  this 
property  is  not  always  a  disadvantage.  We  are  attempting, 
after  all,  to  increase  the  spectral  resolution  in  the 
observed  data  set  only  in  most  cases.) 

An  example  where  this  has  been  done  is  given  in  figure 
9(taken  from  [29]).  The  example  data  set  is  an  air  to  air 
VHF/UHF ISAR  measurement  set  on  a  KC135  jet  [39]. 
The  frequency  band  was  originally  measured  from  160  to 
360  MHz  and  the  angle  span  from  -20  to  20°.  For  the 
analysis  shown  here,  we  used  only  the  frequency  band 
from  240  to  280  MHz  and  the  angle  data  only  from  -2  to 
2°.  Figure  9a  shows  the  FFT  based  ISAR  image. 

Next,  FBLP  autoregressive  techniques  are  used  to 
compute  the  prediction  coefficients  and  extend  the 
available  frequency  band  and  angle  span  by  a  factor  of 
approximately  3.  In  this  case,  a  2-D  FBLP  algorithm  of 
order  5  was  used  [34]  to  obtain  the  prediction 
coefficients.  The  algorithm  used  the  full  2-D 
autoregressive  nature  of  the  raw  ISAR  data  in  Cartesian 
space  and^)  [29]. 

Although  there  is  no  limit  to  the  extrapolation  extent,  the 
studies  reported  here  suggest  that  extending  the  domain 
by  a  factor  of  three  is  a  reasonable  limit.  Note  that  the 
data  will  be  predicted  over  the  entire  measurement 
domain.  In  the  method  presented  here,  only  the  data 
outside  the  measurement  domain  is  derived  from  the 
prediction  coefficients,  the  measurement  data  itself  is 
retained  where  available.  Finally,  the  total  span  of 
(predicted  and  measured)  data  is  windowed  (e.g., 
Hamming).  The  windowing  reduces  the  transform 
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domain  sidelobes  while  de-emphasizing  the  energy  of  the 
extrapolated  data.  Finally,  Fourier  transforms  are  used  to 
transform  to  the  image  domain.  Note  that  one  may  choose 
to  extrapolate  the  frequency  band  and/or  the  angle  span 
(although  this  extrapolation  is  actually  applied  in  2-D  to 
the  Cartesian  frequency  components  derived  from  the 
polar  format  transformation). 

The  resulting  image  of  the  KC135  aircraft  is  shown  in 
figure  9b  (taken  from  [29]).  Note  the  tail  and  wing  tips 
as  well  as  the  four  jet  engine  outlets  in  the  image. 
(Remember  that  the  data  were  collected  from  the  rear.) 
Clearly,  the  resolution  of  the  image  is  much  higher  than 
that  of  the  image  in  figure  9a. 


7.  CONCLUSION 

This  paper  has  discussed  radar  target  imaging  techniques 
(SAR  and  ISAR)  based  on  model  based  autoregressive 
spectral  estimation.  We  have  discussed  the  basics  of 
autoregressive  spectral  estimation  and  have  given 
examples  of  the  resulting  images  as  compared  to  Fourier 
based  imaging  techniques. 

We  began  with  spatial  domain  extrapolation,  then 
showed  a  tomographic  image  technique  where  the  image 
was  formed  from  the  high  resolution  down  range  power 
profile.  Using  an  outline  of  general  SAR/ISAR  concepts 
based  on  figure  1 ,  we  then  showed  a  hybrid  algorithm, 
where  only  one  of  the  image  domains  was  extended,  and 
a  full  2-D  FBLP  autoregressive  technique  where  both 
domains  (in  Cartesian  frequency  space)  were  extended. 

In  all  cases,  it  was  demonstrated  that  with  model  based 
spectral  estimation  it  is  possible  to  obtain  resolution  in 
the  SAR/ISAR  image  well  in  excess  of  that  available 
from  Fourier  techniques. 
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Figure  1.  Schematic  diagram  of  the  SAR/ISAR  algorithm. 
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Figure  3.  Direction  of  arrival  and  relative  (to  main  beam)  amplitude  of  secondary  signal  for  a  set  of  sub-apertures  from 
figure  2.b.  (from  [41]) 


Figure  4.  Time  domain  amplitude  response  for  generic  missile  shape  over  the  band  5  to  6  GHz.  (dashed  line  -  Fourier 
results;  solid  line  =  autoregressive  spectral  estimation. 
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Figure  8.  ISAR  image  of  naval  ship.  9.100  to  9.228  GHz;  158.6  to  158.7  angular  degree  span,  (a)  =  Hanning  windowed 
2-D  IFFT;  (b)  Modified  hybrid  technique  of  order  7  (in  cross  range)  without  motion  compensation;  (c)  Modified  hybrid 
technique  of  order  7  (in  cross  range)  linear  motion  compensation.  Note  scaled  ship  image  in  lower  left,  (from  [1]) 


1 


Original  data  .24  to  .28  Ghz,  -2,2  degrees;  No  Extrapolation 


Figure  9.  ISAR  image  of  KC-135  aircraft  ( 240  to  280  MHz;  -2  to  2  degrees  off  rear  aspect)  (a)  2-D  FFT;  (b)  Spectral  and  angular  extrapolation  by  factor  of  3  using 
order  5.  (from  [28]) 
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1.  SUMMARY 

A  difficult  problem  concerns  the  identification  of 
buried  mines  and  unexploded  ordnance  (UXO). 
This  paper  reviews  the  use  of  the  singularity  ex¬ 
pansion  method  and  symmetry  to  identify  such 
targets  and  distinguish  them  from  other  buried 
objects  of  comparable  size. 

2.  INTRODUCTION 

In  trying  to  locate  and  identify  UXO  and  mines  in 
soil,  one  is  faced  with  a  formidable  problem.  One 
needs  to  discriminate  these  from  rocks,  tree  roots, 
shrapnel,  etc.  Otherwise  one  will  have  a  large 
false-alarm  rate.  Here  we  emphasize  electromag¬ 
netic  techniques  utilizing  target  signatures.  (See  C. 
E.  Baum,  Signature  Based  Target  Recognition,  in 
this  volume.)  For  small  targets  in  soil  adequate 
resolution  in  imaging  can  be  very  difficult  due  to 
high-frequency  attenuation  and  scattering  in  the 
medium.  For  large  targets  (such  as  buried  waste 
dumps)  imaging  may  be  more  appropriate,  but  this 
is  not  the  case  considered  here. 

An  important  target  signature  in  this  case  is  the 
natural  frequencies  in  the  singularity  expansion 
method  (SEM).  This  is  manifest  in  more  than  one 
form. 

3.  ELECTROMAGNETIC  SINGULARITY 
IDENTIFICATION  (EMSI) 

In  this  case,  a  ground  penetrating  radar  (GPR) 
illuminates  the  target  with  a  pulse  (or  many 
frequencies)  with  significant  frequencies  in  the 
general  range  of  100  MHz  to  1  GHz  so  that  wave¬ 
lengths  in  the  external  medium  are  of  the  order  of 
the  target  dimensions,  thereby  exciting  the  reso¬ 
nances.  The  target  natural  frequencies  are  signifi¬ 
cantly  changed  from  their  values  in  free  space  by 
the  properties  of  the  surrounding  medium  (soil. 


water).  For  metal  targets  there  are  scaling  relation¬ 
ships  for  the  natural  frequencies,  modes,  and 
coupling  coefficients  based  on  the  constitutive 
parameters  of  the  external  medium.  For  dielectric 
targets  one  can  use  the  internal  resonances  with  a 
perturbation  formula  if  the  wave  impedance  of  the 
exterior  medium  is  lower  than  that  of  the  dielectric 
target. 

4.  MAGNETIC  SINGULARITY 

IDENTIFICATION  (MSI) 

For  highly,  but  not  perfectly,  conducting  targets 
(metal)  there  is  a  very  promising  technique  based 
on  a  set  of  natural  frequencies  corresponding  to 
diffusion  in  the  target.  The  frequencies  are  quite 
low  (kHz  range)  so  that  wavelengths  in  the  exter¬ 
nal  medium  are  large  compared  to  distances  to  the 
exciting  and  measuring  antennas  (typically  coils  in 
the  near  field).  Provided  the  soil  is  non  magnetic 
(permeability  approximately  that  of  free  space),  the 
natural  frequencies  are  independent  of  the  external 
medium,  a  significant  advantage.  These  natural 
frequencies  are  on  the  negative  real  axis  of  the  s 
plane  (Laplace-transform  variable  or  complex  fre¬ 
quency)  and  correspond  to  decaying  exponentials 
in  time  domain.  Together  with  appropriate  unit 
vectors  these  natural  frequencies  can  be  used  to 
characterize  the  magnetic  polarizability  dyadic, 
and  thereby  the  target. 

Besides  buried  targets,  this  technique  is  being 
applied  to  security  systems  such  as  airport  metal 
detectors. 

5.  ACOUSTIC  SINGULARITY 
IDENTIFICATION  (ASI) 

Acoustic/elastodynamic  scattering  also  has  a  sin¬ 
gularity  expansion  representation  with  similar 
properties.  However,  in  a  medium  such  as  water 
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which  only  supports  p  (pressure)  waves  with  no 
significant  s  (shear)  waves,  there  is  a  lack  of  polar¬ 
ization  information,  a  disadvantage.  Its  great 
advantage  applies  in  cases  of  external  media  with  a 
conductivity  large  enough  to  severely  attenuate 
electromagnetic  waves  at  frequencies  of  interest. 

6.  SYMMETRY  IDENTIFICATION 
In  conjunction  with  the  above  techniques,  one  can 
use  target  symmetry  as  an  identifier,  both  for  EMSI 
and  MSI.  In  the  case  of  EMSI  a  target  with  the 
symmetry  of  a  body  of  revolution  with  axial  sym¬ 
metry  planes,  and  with  the  rotation  axis  perpen¬ 
dicular  to  the  ground  surface  has  no  hv  scattering 


(cross  polarization)  in  the  usual  h,v  radar  coordi¬ 
nates.  This  has  been  observed  with  S5mthetic  aper¬ 
ture  radar  (SAR). 

7.  CONCLUDING  REMARKS 
The  various  techniques  discussed  above  each  have 
advantages  and  limitations.  As  such,  they  are 
complementary.  For  the  three  kinds  of  natural  fre¬ 
quencies  one  can  look  at  these  in  a  diagrammatic 
way  as  in  fig.  1.  This  has  the  properties  of  a  matrix 
and  a  Venn  diagram  (as  in  Boolean  algebra).  Note 
in  the  bibliography  the  various  "Notes"  are  pub¬ 
lished  by  AFRL/DEHP,  C.  E.  Baum,  editor. 


EMSI  ASI 


Fig.  1.  SEM-Based  Buried-Target  Identification. 
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1.  SUMMARY 

As  part  of  the  TNO-FEL  Ultra-Wide-Band  Ground- 
Penetrating-Radar  (UWB-GPR)  project,  a  test  facility  for 
controlled  GPR  experiments  was  planned.  Construction  of  this 
sand-box  test  facility  has  recently  been  completed.  At  the  same 
site  another  test  facility,  for  evaluating  various  commercial  of 
the  shelf  (COTS)  sensors  for  the  Dutch  humanitarian  de- 
mining  program  HOM2000,  has  been  realised.  This  article 
describes  the  test  facilities  themselves  as  well  as  the  frame¬ 
work  in  which  they  were  realised. 

2.  INTRODUCTION 

In  recent  years  the  research  into  mine-detection  systems  at  the 
TNO  Physics  and  Electronics  Laboratory  (FEL)  has  increased 
substantially.  Apart  from  establishing  a  firm  knowledge  base 
and  experience  through  ambitious  projects  such  as  UWB- 
GPR,  the  TNO-FEL  has  also  been  actively  involved  with  the 
Dutch  ministery  of  defence’s,  HOM2000,  humanitarian 
demining  project.  Both  the  TNO-FEL  de-mining/mine- 
detection  focus-program,  of  which  UWB-GPR  is  a  key 
component,  and  the  involvement  in  the  HOM2000  humani¬ 
tarian  program  have  spawned  the  need  for  good  testing  and 
experimenting  facilities.  Over  the  past  year,  two  rather  uniques 
facilities  have  been  developed  and  completed.  The  first  is  a 
facility  specifically  for  testing  UWB-GPR  systems  and  for  data 
acquisition.  The  second  is  a  multi-sensor  mine-detection  test 
facility  for  evaluating  systems  under  controlled  but  realistic 
conditions. 

3.  TESTING  FACILITY  FOR  CONTROLLED 
UWB-GPR  EXPERIMENTS 

In  this  section  we  will  describe  the  development  of  the  first 
test  facility  which  has  been  constructed  at  TNO-FEL  for 
carrying  out  controlled  experiments  with  a  ground-penetrating 
radar  system.  This  test  facility  has  been  developed  in  coopera¬ 
tion  with  the  Delft  University  of  Technology,  where  a  parallel 
project  in  the  area  of  ground-penetrating  radar  is  being  carried 
out.  One  of  the  principal  goals  was  to  perform  the  neccesarry 
analytical  and  design  work  to  support  the  construction  of  an 
experimental  facility,  where  target  objects  can  be  buried  in  a 
sand  medium  and  illuminated  by  an  ultra-wideband  electro¬ 
magnetic  field.  The  sand  medium  should  be  a  realistic,  yet 
controlled  environment  in  which  objects  of  interest  can  be 
buried.  The  electromagnetic  field  scattered  by  the  buried 
object  has  to  be  observed  and  diagnosed  for  the  benefit  of 
developing  an  computer  database  containing  the  responses  of 
a  large  number  of  objects  (i.e.landmines  etc.) 


3.1  COOPERATION  WITH  DELFT  UNIVERSITY 
OF  TECHNOLOGY 

The  Delft  University  of  Technology,  is  engaged  in  a  ground- 
penetrating  radar  project  called  "Improved  ground-penetrating 
radar  technology".  This  project  started  in  January  1996  and 
will  end  in  December  1999.  It  is  supported  by  the  Dutch 
Technology  Foundation  STW,  (Stichting  voor  de  Technische 
Wetenschappen)  under  contract  number  DMB55.3649.  The 
objective  of  the  project  is  the  development  of  a  new  electro¬ 
magnetic  technique  for  the  geological  characterisation  of  the 
shallow  subsurface  of  the  earth.  In  this  particular  STW  project 
four  different  parts  are  distinguished. 

•  The  first  part  of  the  project  comprises  the  design  and 
realization  of  a  transportable  ground-penetrating  radar 
system.  This  subproject  concentrates  on  the  development 
of  antennas  in  a  frequency  band  that  ranges  from  200  MHz 
up  to  1  GHz.  It  consists  of  a  transmitting  and  receiving 
device,  in  which  time-domain  signals  (nanosecond  pulses) 
are  generated,  received  and  processed. 

•  The  second  part  of  the  project  contains  the  construction  of 
a  full-size  testing  site  in  order  to  perform  controlled  ex¬ 
periments  with  the  ground-penetrating  radar  system. 

•  The  third  part  of  the  project  comprises  the  development  of 
a  fast  and  accurate  electromagnetic  modeling  method.  This 
modeling  method  provides  the  basis  for  the  electromag¬ 
netic  imaging  and  furthermore  provides  a  means  to  check 
the  performance  of  a  field  measurement  over  a  known 
geological  situation.  Adequate  measurements  at  the  testing 
site  should  provide  the  input  data. 

•  The  final  part  of  the  project  comprises  the  development  of 
practical  strategies  and  processing  algorithms  with  the  fi¬ 
nal  goal,  the  realization  of  the  actual  image.  Not  only  the 
test  site  will  be  used,  also  more  realistic  geological  sites 
will  be  investigated  and  a  compared  to  existing  ground- 
penetrating  tools  and  conventional  techniques. 

The  development  of  the  new  ground-penetrating  radar  system 
at  Delft  University  of  Technology  will  take  place  in  a  coop¬ 
eration  between  the  International  Research  Center  for  Tele¬ 
communications  Transmission  and  Radar  (IRCTR)  and  the 
Centre  for  Technical  Geoscience  (CTG),  research  school. 
IRCTR  is  hosted  by  the  Faculty  of  Electrical  Engineering  and 
is  directed  by  Prof.  Dr.  L.P.  Ligthart.  CTG  is  a  cooperation 
between  the  Laboratory  of  Electromagnetic  Research  of  the 
Faculty  of  Electrical  Engineering  (directed  by  Prof.  Dr.  P.M. 
van  den  Berg)  and  the  section  Technical  Geophysics  of  the 
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Faculty  of  Mining  and  Petroleum  Engineering  (directed  by 
Prof.  Dr.  J.T.  Fokkema).  Prof.  Dr.  J.T.  Fokkema  is  also  the 
project  leader  of  the  STW  project  and  as  such  the  point  of 
contact  for  TNO  FEL. 

The  third  part  of  the  STW  project  (the  development  of  a 
electromagnetic  modeling  method)  is  partly  sponsored  by 
TNO-FEL.  TNO-FEL  has  awarded  a  research  grant  for  a  post¬ 
doc  to  participate  in  the  project  for  one  year.  The  second  part 
of  the  STW  project,  which  is  the  construction  of  the  test 
facility,  is  performed  in  cooperation  with  the  Electromagnetics 
Section  of  TNO  FEL,  since  such  a  testing  facility  was  also 
needed  for  the  another  TNO  FEL  project  namely,UWB  GPR. 
The  test  facility  was  built  on  the  premisses  of  TNO  FEL 
(completed  July  1997)  and  will  be  used  for  both  projects.  It  is 
clear  that  a  cooperative  effort  in  the  development  of  such  a 
testing  facility  has  many  advantages,  which  not  only  includes 
the  sharing  of  the  financial  burden. 

The  project  at  Delft  University  of  Technology  sponsored  by 
STW  and  the  UWB-GPR  project  at  TNO-FEL  sponsored  by 
the  Netherlands  Army  are  complementary;  knowledge  and 
experience  that  is  gained  in  the  projects  can  be  exchanged  as 
well  as  for  example  measurement  data.  Both  projects  differ  in 
signal  processing;  the  STW  project  aims  at  detection  only; 
characterization  or  classification  is  postponed  until  the 
imaging  step  can  be  satisfactorily  performed.  TNO-FEL  aims 
at  the  exploitation  of  the  singularity  expansion  method  to 
perform  detection  and  classification.  Furthermore,  since  a 
university  is  not  a  defence  research  institution,  the  goal  of 
their  ground-penetrating  radar  system  is  directed  towards  civil 
applications,  like  engineering  activities  and  environmental  and 
archeological  investigations.  Examples  are  the  detection  of 
cables  and  pipes,  the  determination  of  the  stability  of  dikes, 
the  detection  of  leakage  and  the  and  the  spreading  of 
(in)organic  pollution  and  so  on,  and  not  primarily  landmines. 


3.2  CONSTRUCTION  OF  THE  TESTING  FACILITY 

Within  the  framework  of  the  cooperation  with  the  Delft 
University  of  Technology,  a  full  size  testing  facility  has  been 
designed  and  constructed  on  the  premisses  of  TNO  FEL,  in 
order  to  perform  controlled  radar  experiments.  The  testing 
facility  has  been  completed  in  July  1997.  It  is  located  in  the 
dunes  directly  behind  the  TNO-FEL  building,  on  a  location 
that  was  formerly  known  as  the  “EMP  site”  where  electromag¬ 
netic  pulse  experiments  were  once  carried  out. 

The  test  facility  consists  of  a  buried  wooden  box.  The  dimen¬ 
sions  of  this  box  are  10  m  x  10  m  wide  and  3  m  deep  (300 
cubic  meters).  The  box  has  been  fully  dug  in  into  the  ground, 
hence  the  bottom  of  the  box  located  three  meters  below  the 
ground  level.  Special  care  has  been  taken  not  to  use  any  metal 
parts  in  the  construction  of  the  box  or  in  the  vicinity  thereof. 
The  sandbox  is  filled  with  clean  (homogeneous)  river  sand 
from  the  (Dutch)  Caland  canal.  In  order  to  keep  the  condition 
of  the  sand  in  the  box  optimal  and  to  prevent  pollutions  from 
the  outside  (for  example  ground  water)  entering  the  box,  a 
drainage  system  was  installed  and  the  inside  of  the  box  was 
covered  with  a  watertight  plastic  lining.  To  prevent  the 
weather  from  influencing  the  test  conditions  and  to  protect  the 
measuring  equipment,  a  large  tent  covers  the  entire  site. 
Additionally,  a  wooden  shed  is  available  for  storage  of 
personal  computers  and  measurement  equipment.  While  filling 
the  box  with  sand,  special  care  was  taken  to  get  a  homogene¬ 
ous  profile.  Current  tests  indicate  that  this  was  not  entirely 
successful  but  should  be  good  enough  to  work  with.  Later  on 
in  this  study  we  will  probably  empty  the  box  and  refill  the  box 
with  a  different  ground  composition.  To  facilitate  the  meas¬ 
urement  of  EM  transmissions  into  the  ground  a  square  PVC 
tube  running  from  the  surface  of  one  side  to  the  bottom  of  the 
other  side  has  been  installed  about  one  meter  from  the  edge  of 
the  sandbox. 

On  the  next  page  we  show  a  small  photo  gallery  which  shows 
the  progress  in  the  construction  of  the  UWB-GPR  test  facility. 


^'2  Start  of  the  activities.  Dismantling  of  the  EMP  site.  The  location  is  being  prepared  for  construction  activities.  In 

the  background  the  (green)  wooden  storage  shed,  where  the  equipment  Is  kept,  is  visible. 
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1.3  Start  of  the  construction  of  the  wooden  box.  First,  some  wooden  piles  are  driven  into  the  ground.  The  piles  are 

used  as  a  framework  to  strengthen  the  construction  of  the  walls. 


3.4  Construction  of  the  walls  of  the  box  using  wooden  beams.  The  wood  that  was  used  to  construct  the  box  was 

tropical  hardwood  from  Central  Africa. 


Figure  3.5 


While  the  walls  of  the  box  were  being  constructed,  the  soil  at  the  location  was  removed  simultaneously. 


Figure  3.6 


Last  part  of  the  digging  phase.  The  wooden  box  is  almost  finished, 


Installation  of  the  drainage  system  in  the  wooden  box. 


Figure  3.9 


Overview  photo.  The  wooden  box  is  now  ready  to  be  filled  with  sand. 


Figure  3.11 


The  wooden  box  is  filled  with  clean  (homogeneous)  sand. 


Figure  3.12  Installation  of  a  square  PVC  tube  running  from  the  surface  of  one  side  to  the  bottom  of  the  other  side  of  the  box 

facilitating  measurements  of  EM  transmissions  into  the  ground. 
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4.  HOM2000  TEST  FACILITY 

The  other  testfacility  on  the  TNO  test-site  “Waalsdorp”  is  the 
HOM2000  test  facility.  The  main  purpose  of  this  facility  is  the 
testing  of  commercial  off  the  shelf  (COTS)  sensors  that  can  be 
used  for,  and/or  improved  for  mine-detection.  Currently  four 
sensor  type  are  evaluated.  Metal  detectors,  thermal  infrared 
imaging  detectors,  and  ground  penetrating  radars.  For  each 
sensor  type,  four  systems  are  being  tested 

4.1  CONTRUCTION 

The  HOM2000  test  facility  consists  of  six  test  lanes.  Each  lane 
has  a  length  of  10m,  width  of  3m  and  a  depth  of  1.5m.  Every 
lane  contains  a  different  kind  of  soil  representative  of  soil 
found  in  actual  minefield  locations.  The  kinds  of  soil  used  are, 
sand,  clay,  peat,  soil  with  a  high  iron  content,  forrest  soil,  and 
rocky  soil. 

The  test  facility  was  constructed  in  such  a  way  as  to  minimize 
external  influences.  No  metal  components  are  allowed  inside  a 
zone  of  5m  around  the  whole  test-rig.  Of  course  the  test  lanes 
themselves  were  also  constructed  without  the  use  of  any  metal 
whatsoever.  The  facility  was  constructed  along  a  north-south 
axis  so  that  infra-red  sensors  would  not  have  to  take  measure¬ 
ments  in  their  own  shadow.  A  groundwater  level  management 
system  was  also  installed  to  be  able  to  keep  the  groundwater 
table  as  a  constant  level.  To  facilitate  accurate  positioning  of 
the  sensors  a  non-metal  measurement  platform  was  designed 
and  installed. 

4.2  MEASUREMENT  PLATFORM 

The  measurement  platform  is  basically  a  big  10m  long 
pvc/glassfibre  bridge  (tubular).  Wheels  on  either  end  of  the 
bridge  are  running  on  rails  located  at  the  head  and  foot  of  the 
test  lanes.  In  this  way  the  whole  bridge  can  be  moved  over  one 
of  the  test  lanes.  Once  in  position  the  bridge  can  be  locked 
into  place  by  a  nylon  bolt  which  fits  in  one  of  a  series  of  holes 
every  0.05m.  The  sensor  is  mounted  on  a  small  belt  driven 
trolley  which  is  running  on  tracks  over  the  full  length  of  the 
bridge.  The  trolley  has  a  small  platform  with  mechanical 
adaptors  to  facilitate  the  mounting  of  the  different  sensors.  At 
the  end  of  the  bridge,  just  outside  the  5m  metal-free  zone,  an 
electro-motor  is  connected  to  the  belt  pulling  the  trolley  along 
the  bridge.  The  maximum  speed  of  the  trolley  is  about 


lOm/minute.  Braking  is  gentle  but  an  emergency  brake  will 
stop  the  trolley  abruptly  in  case  it  is  needed. 

To  be  able  to  record  the  position  of  the  sensor  acurately  a  laser 
distance  meter  is  mounted  on  the  far  side  of  the  bridge.  The 
laser  beam  is  reflected  off  a  reflector  mounted  on  the  trolley 
and  an  accurate  position  reading  and  time-stamp  is  registered 
so  many  times  per  second.  This  way  an  accurate  record  of  the 
position  of  the  sensor  is  available  for  each  sensor. 

4.3  SURROGATE  LAND-MINES 

Inside  every  test-lane  a  number  of  surrogate  land-mines  and 
potential  false  alarms  are  buried  at  different  locations  and 
depths.  These  landmines  are  representative  of  what  is  typical 
in  a  real  minefield.  The  emphasis  however  is  mainly  on  anti- 
personel  mines  although  some  anti-tankmines  are  also  burned 
(and  of  course  a  couple  of  false  targets  were  also  put  in  place). 
Control  of  the  whole  test-site  and  testlanes  is  quite  strict  as  not 
to  disturb  the  measurement  conditions.  Not  only  this  but  also 
security  is  an  important  reason  to  impose  strict  access  control. 

Once  buried,  the  mines  will  not  be  dug  up  again.  In  fact  a 
certain  time  (months!)  has  to  be  allowed  to  let  the  mines  settle 
in  the  ground  because  it  takes  quite  a  while  before  the  ground 
disturbance  created  when  hurrying  the  mines  is  deminished. 

To  be  able  to  do  some  exprimenting  a  small  area  of  each 
testlane  was  set  aside.  This  area  can  be  used  to  burry  sensors 
or  reference  objects.  To  be  able  to  determine  the  character- 
sitics  of  the  sensor  with  respect  to  “virgin”  soil  each  test-lane 
also  contains  a  small  areas  devoid  of  any  surrogate  landmines 
or  false  targets. 

The  construction  of  the  surrogate  landmines  was  done  is  such 
a  was  as  to  optimise  it’s  resemblance  to  actual  live  mines.  The 
resemblance  to  real  mines  is  not  just  visual  but  care  was  also 
take  to  use  materials  whose  electromagnetic  and  thermal 
properties  closely  match  those  of  actual  landmines.  Details  of 
the  construction  are  not  included  here  but  could  under  certain 
conditions  (constrains)  be  made  available  to  others. 

The  following  pictures  show  the  construction  of  the  measure¬ 
ment  platform.  In  the  picture  you  can  see  the  antenna  array 
and  tick-wheel  of  a  commercial  GPR  system  which  is  mounted 
on  the  platform.  For  actual  measurements  the  tick-wheel 
interface  will  be  connected  to  the  output  of  the  laser  position 
indicator. 


A  view  of  the  HOM2000  measurement  platform.  In  the  background  the  tent  covering  the  UWB-GPR  test  facility 
can  be  seen.  In  the  distance  one  notices  the  TNO-FEL  radar  tower. 


1 - ■■ 

1  '  ' 

12-1 


Practical  Application  of  Pattern  Recognition  Techniques 

Hendrik  Rothe 
Arno  von  der  Fecht 
Andre  Kasper 
Thomas  Kinder 

e-mail:  rothe@unibw-hamburg.de 
Phone:  040/6541-2723  Fax:  040/6541-2743 
Mess-  und  Informationstechnik,  FB-MB,  Universitaet  der  Bundeswehr 
Holstenhofweg  85,  D-22043  Hamburg 


1  Introduction 

This  paper  deals  with  three  examples  for  pat¬ 
tern  recognition  and  an  estimation  of  the  com¬ 
putational  effort  needed  for  the  real  time  parts 
(prediction)  of  the  following  approaches: 

•  Discriminant  Analysis 

•  Feedforward  Neural  Network 

•  Sugeno  Fuzzy  Classificator. 

The  first  example  is  of  the  simple  pattern  match¬ 
ing  type.  A  satellite  photograph  of  our  university 
from  a  freely  distributed  CD-ROM  was  used.  We 
extracted  the  clock  tower  of  the  building  where 
the  chair  for  measurement  and  information  tech¬ 
nology  is  located  and  tried  to  find  that  place 
by  correlation  analysis.  This  pattern  recognition 
works  without  features.  We  found  the  location 
of  the  clock  tower  with  an  accuracy  of  about  ±1 
pixel  (±1  meter)  in  an  photograph  which  actu¬ 
ally  displays  2x2  kilometers.  The  interesting 
point  is  here  however,  that  one  can  use  data  freely 
available  and  a  simple  PC-platform  with  common 
numerical  software  like  MatLab^^ , 

In  the  second  example  we  show  our  work  in  the 
field  of  in-situ  inspection  of  high  performance  op¬ 
tics  (e.g.  degradation  of  airborne  surveillance  sys¬ 
tems)  and  supersmooth  surfaces,  like  wafers,  hard 
discs  and  flat  panel  displays. 

In  this  case  the  extraction  of  the  correct  feature 
variables  and  the  appropriate  classificators  is  of 
great  importance  for  the  performance  of  the  pat¬ 
tern  recognition  system. 

It  is  shown  that  empirical  statistical  moments  and 
Discriminant  Analysis  perform  best. 


The  third  example  of  the  paper  is  based  on  a 
range  gated  3D  laser  radar  system  without  scan¬ 
ner  developed  in  our  group.  The  design  of  the 
system  is  explained  and  measurement  results  are 
presented.  This  part  of  the  paper  explains,  why 
one  can  use  very  simple  features  and  straightfor¬ 
ward  classifiers,  if  the  hardware  of  the  pattern 
recognition  system  is  specially  designed  for  clas¬ 
sification  purposes.  To  our  knowledge  it  is  the 
first  time  that  such  an  approach  is  published  un¬ 
classified. 


2  Pattern  Matching  in  Satellite- 
photographs 

2.1  List  of  Symbols 


Symbol 

Description 

A,U 

matrices 

G 

set 

Rxx 

auto  correlation  function 

E... 

ensemble  expectation  operator 

R tx 

Fourier  transformed  Rxx 

X{f) 

Fourier  transformed  x 

(•••)* 

complex  conjugated 

F{x) 

Fourier  transformed  x 

F{x)-^ 

inverse  Fourier  transformed  x 

Rxy 

cross  correlations  function 

Pxy 

Fourier  transformed  Rxy 

coherency  function 

/?,  /,  Tflx^  riy 

signed  integer 

signed  integer 

fufk 

frequency 

Paper  presented  at  the  RTO  SET  Lecture  Series  on  "Advanced  Pattern  Recognition  Techniques", 
held  in  Bristol,  UK,  14-15  September  1998;  Rome,  Italy,  17-18  September  1998; 

Lisbon,  Portugal,  21-22  September  1998,  and  published  in  RTO  EN-2. 
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2.2  A  simple  Application  of  Pattern 
Matching 

The  detection  of  object  positions  in  images  is  one 
of  the  basic  problems  in  image  processing.  In 
this  section  the  simple  method  of  pattern  match¬ 
ing  will  be  discussed. 

A  rectangular  limited  region  U  will  be  searched 
for  in  an  image  A,  A  and  U  are  matrices  with  the 
dimensions  rxs  and  mxn  such  that  A  G  X G'^ 
and  U  £  X  with  r,  s^m^n  £  N.  G  is  the 
quantity  of  gray  values  with  G  =  {0, 1, 2, 255}. 
The  matrix  U  is  nearly  a  copy  of  a  submatrix  V 
in  A,  the  searched  object.  Finding  the  position 
of  the  submatrix  V  is  an  equivalent  formulation 
of  the  problem.  In  preparation  of  the  solution  it 
will  be  assumed  that  A  (the  digitalized  image)  is  a 
discrete  piece  of  a  normally  distributed,  place  de¬ 
pending,  2-  dimensional  process  (the  real  image). 
With  this  cissumption  the  cross  correlation  func¬ 
tion  (CCF)  can  be  used  to  detect  the  searched 
position.  If  two  shifted  images  are  similar,  the 
CCF  will  be  maximized  at  shift  coordinates  [1]. 


and  the  cross  power  spectrum  as 

-|-oo 

Pxy{f)=  E  (6) 

m=— oo 

if  x(m,  n)  and  ?/(m,  n)  are  real  valued.  The  equa¬ 
tions  (1)  to  (6)  apply  for  one  -  dimensional  pro¬ 
cesses. 

The  generalization  for  two  -  dimensional  func¬ 
tions  x(m,  n)  and  y(m,  n)  is  simply  given  by 

Rxy  {k,  l)  =  E{x  {n,  m)  •  y  (n  +  fc,  m  -h  /)}  (7) 


p,y  (A,  fi)  =  E{X  (A,  fi) .  y*  (A,  m  (8) 


Finally  the  coherency  function  (l>{fk^  fi)  is  defined 
as 


2.3  Mathematical  formalism 

The  auto  correlation  function  or  sequence  of  a 
stationary  process  a:(n)  is  defined  by 

Rxx  (^)  =  E  {x  (n)  *  X  {n+  m)}  (1) 


Equation  (9)  contains  mainly  information  about 
the  similarity  of  two  two  -  dimensional  func¬ 
tions  or  sequences.  For  instance,  the  coherency 
function  of  two  two  ~  dimensional,  identical  but 
shifted  functions  /i(m,  n)  and  y(m,  n)  with 


where  denotes  the  ensemble  expectation 

operator  and  x{n)  is  real  valued.  The  power  spec¬ 
trum  is  formally  defined  as 

P,Af)  =  E{Xif)-X*{f)}  (2) 

where  X  (/)  is  the  Fourier  transformation  of  x{n) 
with 

-\-oo 

Xif)=  (3) 

m=~oo 

According  to  Wiener  Chintchine  theorem  it  is  the 
Fourier  transform  (FT)  of  the  auto  correlation 
function  and  may  consequently  be  written  as 

+00 

P.Af)=  E  =  (4) 

m=— OO 

Based  on  the  basic  definitions  (1)  and  (4)  the 
cross  correlation  function  is  defined  as 


=  +  (10) 

is  constant  with  the  value  one  and  leads  to  the 
identity 

Pgk^Pgge^^^(fk-m^  +  fl-ny)_ 

Otherwise  the  shift  Uy  can  be  calculated  from 
the  phase  term.  In  the  other  case  the  coherency 
function  is  nearly  zero.  This  means  that  there  is 
a  random  relative  phase  distribution  between  the 
two  functions  [2].  If  the  functions  ft(m,  n)  and 
y(m,  n)  are  stochastically  distributed,  without 
periodical  components  and  mean  value,  the  func¬ 
tion  Rgg{m^  n)  is  globally  maximized  at  the  point 
(0,0).  So  the  calculation  of  the  shift  ruy^ny  can 
be  done  simply  with  the  inverse  Fourier  transfor¬ 
mation  of  equation  (11) 

Rgh  (fc,  0  =  Rgg  {k  +  m^,  I  +  Uy)  (12) 


Rxy  (^)  =  E{x{n)  *y{n  +  m)} 


(5)  and  with  a  maximum  search  at  Rgh  (A:,  /)  [3]. 


r.  i-  W^jor  'iKpiii 


with  U  E  X  G^.  So  the  final  equation  can  be 
written  as 

Rau  {k,  0  =  F-i  {F{A-A)-F  {V  -  U))  .  (14) 

With  the  equation  (13)  the  shift  coordinates  be¬ 
tween  the  regions  U  and  V  are  k  =  and  I  =  Uy 
which  maximize  Rau  (^jO* 


;  '%  '"I  I 


Figure  1:  Satellite  photography 


Figure  2:  Satellite  photography  -  searched  region 


Figure  3:  CCF 


2.4  A  simple  application 

With  the  assumptions  given  in  section  2.2  a  sim¬ 
ple  application  will  be  shown.  The  image  A  is 
shown  in  Figure  1  and  the  searched  region  U  in 
Figure  2. 

Figures  1  and  2  are  noncontinous,  equidistant 
sampled  regions.  In  section  2.3  it  has  been  shown 
that  there  are  two  ways  to  calculate  function 
RAuikJ)^  see  equation  (7)  and  the  inverse  of 
equation  (8).  Equation  (7)  is  simple  to  evaluate 
on  A  and  U.  It  is  a  non  causal  convolution.  With 
the  assumption  that  the  regions  are  periodically 
continued  in  both  dimensions,  with  a  loss  of  ac¬ 
curacy  (ignoring  bounding  effects)  the  inverse  of 
equation  (8)  can  be  used.  To  fit  the  smaller  re¬ 
gion  U  in  dimensions  of  A  the  simple  zero  padding 
methode  can  be  used,  with  f/'  as  the  zero  padded 
region  U 


/Wl,l 

Ul,s 

0  . 

0  . 

0 

, 

0  . 

0 

0  . 

0 

Ur,l 

Ur,s 

0  . 

0 

0 

0 

0 

0 

0  . 

0 

0 

0 

0 

0 

0  0 

o) 

2.5  Application  hints 

Equation  (14)  is  easy  to  compute  using  the 
Matlab^^  computation  system.  The  following 
script  {Matlab^^  Version  5.2,  image  processing 
toolbox)  shows  the  computation  of  Figure  3. 


The  variable  C  represents  Rau  {k^  1)  and  is  the¬ 
oretically  real  valued  but  numerical  errors  of  the 
computation  system  produce  small  imaginary  val¬ 
ues  in  G.  The  computation  speed  can  be  in¬ 
creased,  if  m,n,r,  s  have  the  logarithmical  base 
2.  So  the  computation  of  the  Discrete  Fourier 
Transform  (DFT)  can  be  changed  to  Fast  Fourier 
Transform  (FFT)  [3]. 
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The  most  accurate,  but  slower  way  to  calculate 
Rau  (fc?  0  equation  (7)  which  is  shown  in  the 
next  Matlab'^^  script. 


Apart  from  a  few  limited  scatter  theories,  pattern 
recognition  methods  must  be  employed  to  link  the 
BRDF  to  surface  properties. 


[bl  mapl]  =  imreadC ' campgray.pcx’ ) ; 

[b2  map2]  =  imread( ’ campgraypat .pcx’ ) ; 
A  =  double(bl)  ~  mean(inean(bl) ) ; 

U  =  double(b2)  -  mean(inean(b2) ) ; 

C  =  conv2(A,flipud(fliplr(U))) ; 

[vx  vy]  =  f ind(max(max(C))  ==  C) ; 


3  Classification  of  Surfaces  by 
using  Backscattering 

Measuring  the  distribution  of  light  scattering 
back  from  a  illuminated  surface  is  a  fast  and  non- 
contact  approach  to  characterize  surfaces  [4],  i.e. 
measuring  surface  roughness,  detection  of  defects 
and  contaminations,  characterization  of  surface 
textures.  The  bidirectional  reflectance  distribu¬ 
tion  function  (BRDF)  can  be  used  to  describe 
the  scattering  of  light  from  a  illuminated  surface. 
The  BRDF  is  defined  as  the  ratio  of  scattered  ra¬ 
diance  Ls  at  an  particular  solid  angle  to  the 
surface  irradiance  Ei. 


BRDF  = 


^iCos6s9.s 


(15) 


As  illustrated  in  Figure  4  the  incident  ray  is  de¬ 
fined  by  its  incident  angle  9i  and  the  scattered 
ray  is  defined  by  the  azimuth  angle  (j)s  (measured 
from  plane  of  incidence) ,  the  scatter  angle  6^  arid 
the  solid  angle 


Z 


Figure  4:  Definition  of  BRDF. 


3.1  Sample  data 

The  angle  resolved  BRDF  data  used  in  the  fol¬ 
lowing  sections  were  taken  from  11  surfaces,  with 
different  microtopographies.  Each  surface  repre¬ 
sents  one  group.  A  group  consists  of  two  data  sets 
with  50  observations  each.  One  data  set  (recogni¬ 
tion  set)  is  necessary  for  the  learning  part  of  the 
classification  and  the  other  set  (prediction  set)  is 
intended  for  testing  the  classification.  In  Figure 
5  the  11  group  means  are  shown.  Only  the  scat¬ 
ter  distribution  in  the  plane  of  incidence  [(j)s  —  0) 
is  evaluated.  The  angle  of  incidence  was  fixed  to 
38". 


Figure  5:  Plot  of  group  means. 


3.2  Statistical  moments 

The  advantage  of  using  statistical  moments  as 
feature  variables  ist  that  they  are  easy  and  fast 
to  calculate.  A  drawback  of  statistical  moments 
is  that  they  only  describe  the  statistical  distribu¬ 
tion  of  the  BRDF  independently  of  the  function 
parameters  ^5, 0s. 

The  computed  statistical  moments  of  the  recog¬ 
nition  set  are  shown  in  Figure  6. 

The  statistical  moments  of  group  8,  9  and  10  do 
not  show  any  significant  diflFerence,  therefore  clas¬ 
sification  errors  are  very  likely  for  these  groups. 

3.3  Circular  moments 

To  overcome  the  mentioned  drawback  of  linear 
statistical  moments,  modern  circular  statistics 
can  be  employed  to  define  circular  trigonometric 
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Figure  6:  Plot  of  linear  statistical  moments  vs. 
observations. 


moments.  Circular  statistic  is  based  on  vectorial 
or  axial  data  on  the  unit  circle  [5].  Therefore,  in 
order  to  employ  circular  statistics  to  BRDF  data, 
it  is  necessary  to  project  the  intensity  values  of 
the  BRDF  on  the  unit  circle.  This  can  be  done  by 
interpreting  the  intensity  value  of  a  certain  solid 
angle  as  a  summmation  of  elementary  events,  e.g. 
number  of  photons  reaching  the  detector.  With 
this  interpretation  one  can  consider  BRDF  data 
as  a  circular  distribution,  as  shown  in  Figure  7. 


BRDF  Transformation  circular  data  distribution 


Figure  7:  Interpretation  of  BRDF  as  circular  data 
distribution,  the  triangles  in  the  right  figure  sym¬ 
bolize  the  number  of  elementary  events. 


Mathematically,  the  transformation  is  given  by 
the  angular  frequency  value  nj, 

.  _  BRDFje.yminjBRDF)  ^ 

—  max{BRDF)-min{BRDF)  ^ 

(16) 

where  9j  is  the  j-th  scattering  angle  Os  {j  = 
min(BRDF)  and  max(BRDF)  are  the 


minimum  and  maximum  values  of  the  BRDF 
data,  respectivly,  is  the  number  of  elementary 
events  the  intensity  value  of  the  BRDF  is  divided 
into,  Us  is  the  number  of  elementary  events  in  the 
sample  data. 

Now  the  sample  mean  direction  9  can  be  calcu¬ 
lated 

C  =  '^j  cos  9j  S  =  rij  sin  Oj 

(17) 

R'^  =  C^  +  52 

(  tan-'^{S/C)  5>0,C>0 
6=1  tan"^  {S /C)  -h  TT  C  <  0 

\  tan-\S/C)  +  2'K  5<0,C>0 

which  should  be  equal  to  the  angle  of  incidence 
9i  for  a  specular  BRDF.  By  defining  the  mean  re¬ 
sultant  length  R  E  (0, 1)  the  sample  circular  vari¬ 
ance  V  G  (0, 1)  and  the  sample  circular  standard 
deviation  v  can  be  calculated. 

R  =  R/us 

(18) 

V  =  1  —  R  V  =  sqrt—2  *  log{l  —  V) 

From  the  centered  sample  trigonometric  moments 

TTlp 

Ne, 

nip  =  l/ris*  Y^njCosp*{ej-e)  (19) 
i=i 

the  sample  circular  dispersion  a  (a  measure  of 
spread) 

a  =  {1  —  m2) /2  *  R^  (20) 

and  measures  of  skewness  s  and  kurtosis  k  can  be 
defined  [5]. 

—  —  3 

5  =  m2  *  5m(^2  20)_/(l  - -R)2  _ 

k  =  (m2  *  cos{p2  —  2^)  -  ■R^)/(l  R)^ 

The  value  p2  is  calculated  from  the  second  un¬ 
centered  sample  trigonometric  moment 

Cp  =  l/Us  *  %  cos(p^j) 

(22) 

Sp  =  l/us  *  sm{p9j) 

(  tan~^  (S2/C2)  §2  >  0,C2  >  0 
P2  —  \  tan  ^  {S2/C2)  T  F  C2  ^  0 

\  tan~^  (^2/^2)  +  27r  *§2  <  0,  C2  >  0 

The  computed  circular  moments  of  the  recogni¬ 
tion  set  are  shown  in  Figure  8. 

As  one  can  see  from  the  feature  plot  the  circular 
moments  of  group  8  and  9  do  not  show  any  sig¬ 
nificant  difference,  therefore  classification  errors 
are  very  likely  for  these  groups. 
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3.4  Features  based  on  BRDF  models 


Another  approch  to  feature  extraction  of  BRDF 
data  is  based  on  the  approximation  of  BRDF 
models  and  using  the  estimated  parameters  as 
feature  variables.  Quite  a  lot  of  BRDF  models 
were  suggested  by  the  computer  graphics  commu¬ 
nity.  The  definition  of  two  models  will  be  given. 
One  of  the  earlier  and  still  quite  popular  mod¬ 
els  ist  the  Phong  model  [6].  A  slightly  modified 
model  was  introduced  by  [7]  and  is  defined  as  a 
sum  of  a  diffuse  part  and  a  specular  part. 


Figure  10:  Plot  of  coefficients  of  Phong  model  vs. 
observations. 

diffuse  part  a  angle  dependent  diffuse  part  and  a 
specular  part. 

/2(^S5  ^0  =  + 

aii^i  +  ^s)  +  +  a36i0scos{<ps)+  (24) 


fi{0s,  Oi)  =  ail  +  a2^^^cos{0s  -  (23) 

The  diffuse  part  depends  on  ai  and  the  specular 
part  depends  on  02  and  Two  examples  for  the 
Phong  approximation  of  a  diffuse  and  a  specular 
BRDF  are  shown  in  Figure  9. 

The  least  squares  approximation  was  done  with 
the  QndiSi-Newton  algorithm.  The  computed  fea¬ 
ture  variables  (01,02,03)  of  the  recognition  set 
are  shown  in  Figure  10. 

The  model  of  Meister  [8]  allows  a  more  accurate 
description  of  the  diffuse  part  and  the  specular 
part.  It  is  defined  as  a  summation  of  a  constant 


The  coefficient  oq  gives  the  constant  diffuse  part 
of  the  sample.  The  angle  dependent  diffuse  part 
is  modeled  by  the  coefficients  01,02,03  and  the 
coefficients  04,05  describe  the  amplitude  and  og 
stands  for  the  width  of  the  specular  part.  Accord¬ 
ing  to  the  model  the  coefficients  oq,  04,  05  and  oe 
should  be  positive.  Figure  11  shows  two  aproxi- 
mations  of  the  Meister  model. 

The  least  squares  approximation  was  done  with 
the  Qiidisi-Newton  algorithm.  The  computed  fea¬ 
ture  variables  (oi  “  oe)  of  the  recognition  set  are 
shown  in  Figure  12. 
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Figure  11:  Plot  of  two  approximations  with  Meis- 
ter  model,  1.  of  a  specular  BRDF,  r.  of  a  smooth 


BRDF. 


Figure  12:  Plot  of  coefficients  of  Meister  model 
vs.  observations. 


3.5  Features  based  on  spline  approxi¬ 
mation 

In  the  last  section  the  BRDF  was  approximated 
by  a  certain  function  model.  In  the  case  of  not 
knowing  the  shape  of  the  approximation  function, 
spline  -  curves  can  be  employed  to  approximate 
the  light  distribution.  For  this  example  the  B- 
spline  form  is  employed  [9].  The  basis  functions 
of  a  B-spline  curve  are  called  B-spline  functions. 
A  B-spline  function  of  order  k  with  the  ordered 
knot  sequence  T  (the  parameter  values  of  the  B- 


spline  function) 


T  —  (itoj  ^1?  ^2?  •••5  15  ^n-\-k) 

h  ^  <  ^n-\-k 


(25) 


can  be  defined  as  recurrence  relation,  as  follows: 


if  =  1 
N, 


if  U  <t  <  ti^i 
else 


= { j  “ 

if  A;  >  1 

Ni^k{t)  =  {i)  + 


(26) 


A  B-spline  curve  of  order  k  is  given  as  linear  com¬ 
bination  of  spline  functions  Ni^k- 

n 

=  (27) 

i=l 

The  coefficients  ai  are  called  the  control  points  or 
de  Boor  points  for  the  curve.  The  smoothness  of 
the  B-spline  curve  can  be  controlled  by  the  knot 
sequence  T,  i.e.  for  a  jump  in  the  first  derivative 
at  ^3  one  would  repeat  the  knot  value  ^3  fc  —  1 
times  in  the  knot  sequence.  This  B-spline  prop¬ 
erty  can  be  used  to  improve  the  approximation  of 
BRDFs  with  a  strong  specular  part. 

The  approximation  of  a  given  set  of  points  Pi  (i.e. 
a  measured  BRDF)  can  be  expressed  cts  the  min¬ 
imization  of  the  sum  of  the  square  errors  Di. 

A  =  Bk  Hi)  -Pi  A-  min  (28) 

i=0 

Partially  deriving  to  ai  leads  to  a  linear  sys¬ 
tem  of  equations.  Solving  this  system  of  equa¬ 
tions  gives  the  desired  control  points  a^-.  An  ex¬ 
ample  of  two  B-spline  approximations  of  differ¬ 
ently  shaped  light  distributions  are  shown  in  Fig¬ 
ure  13. 

The  control  points  ai  of  the  approximated  B- 
spline  curve  can  be  used  as  feature  variables  when 
the  knot  sequence  T  and  the  order  k  are  constant 
for  the  data  set.  The  computed  feature  variables 
of  the  lerning  set  based  on  a  B-spline  curve  of  or¬ 
der  four  with  six  intervals  and  one  possible  jump 
in  the  first  derivative  at  6s  =  9i  are  shown  in 
Figure  14. 
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Figure  13:  Two  approximations  with  fourth  order 
B-splines  and  six  intervals,  1.  with  jump  in  the 
first  derivative  at  ^3,  r.  with  continous  derivates. 


group 


Figure  14:  Plot  of  coefficients  of  B-spline  approx¬ 
imation  vs.  observations. 


quality  of  the  fit.  If  the  value  TS^2  is  larger  than 
the  corresponding  quantile  value  the  fit  should 
be  considered  non  significant.  The  degree  of  free¬ 
dom  for  the  quantile  is  given  by  the  number 
of  approximated  data  points  minus  the  number  of 
free  parameters.  Figure  15  shows  the  TSy2  values 
for  the  approximations  applied  to  the  recognition 
set.  The  straight  lines  symbolize  the  correspond¬ 
ing  x^  quantile  values.  From  the  plot  of  Figure  15 
it  can  be  stated  that  the  B-spline  approximation 
gives  the  best  result  according  to  the  approxima¬ 
tion  errors.  Apart  from  the  observations  of  group 
two  and  four  the  TS^2  values  of  the  B-spline  and 
the  Meister  approximations  are  below  the  quan¬ 
tile  values.  Therefore  they  can  be  considered  as 
significant  due  to  the  x^  test.  However,  the  real 
discrimination  performance  of  the  features  can 
only  be  qualified  by  evaluating  the  classification 
results.  Table  1  shows  the  summarized  classifi¬ 
cation  results  for  different  classification  methods. 
For  each  algorithm  the  classification  results  of  the 
recognition  set  and  the  test  set  are  given.  The 
first  three  algorithms  are  based  on  neural  net¬ 
works  with  the  following  start  configurations: 

•  The  perceptron  net  was  initialized  with  11 
neurons  in  the  output  layer. 

•  The  feedforward  net  was  initialized  as  two 
layer  configuration  with  the  number  of  neu¬ 
rons  in  the  input  layer  equal  to  the  number 
of  feature  variables  and  11  neurons  in  the 
output  layer.  The  position  of  the  maximum 
activation  in  the  output  layer  is  considered 
as  classification  result. 

•  The  radial  basis  function  network  was  initial¬ 
ized  in  the  same  manner  as  the  feedforward 
network. 


3.6  Classification  results 

Now  the  ability  of  the  defined  BRDF  features  to 
significant  characterize  a  class  of  BRDFs  is  to  be 
investigated.  A  first  impression  of  the  quality  of 
the  approximation  based  features  can  be  obtained 
by  looking  at  the  approximation  errors.  By  ex¬ 
pressing  the  approximation  error  as  follows 


2  =  1  « 


(29) 


the  statistical  test  can  be  used  to  check  the 
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Figure  15:  Plot  of  TS^2  approximation  errors  vs.  observations. 


Table  1:  Table  of  classification  results,  shows  the  correct  classifications  in  percent. 


classification 

algorithm 

empirical 

moments 

circular 

moments 

Phong  model 
coefficients 

Meister  model 
coefficients 

B-spline 

coefficients 

all 

features 

Perceptron 
recognition  set 
prediction  set 

75.3% 

73.3% 

30.0% 

26.7% 

22.2% 

22.5% 

52.2% 

39.6% 

90.2% 

82.9% 

80.4% 

62.4% 

Feed  Forward 
recognition  set 
prediction  set 

92.4% 

95.0% 

82.9% 

86.5% 

62.5% 

73.8% 

78.9% 

93.6% 

94.7% 

98.6% 

93.3% 

99.3% 

Radial  Basis  Function 
recognition  set 
prediction  set 

100.0% 

82.2% 

98.4% 

15.9% 

100.0% 

51.5% 

100.0% 

10.3% 

100.0% 

44.4% 

100.0% 

10.3% 

Discra 

recognition  set 
prediction  set 

98.4% 

99.1% 

86.6% 

81.8% 

90.4% 

88.4% 

81.8% 

74.9% 

99.5% 

97.7% 

99.8% 

99.8% 
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The  following  conclusions  can  be  drawn  from  the 
classification  results: 

•  The  discriminant  analysis  and  the  feedfor¬ 
ward  neural  network  work  best  with  the 
introduced  features,  whereas  the  discrimi¬ 
nant  analysis  is  slightly  better,  especially  for 
worse  features. 

•  The  classification  performance  of  the  percep- 
tron  neural  network  is  not  suificent. 

•  The  feedforward  neural  network  gives  a  very 
good  classification  result  for  the  learning  step 
but  for  the  test  set  the  classification  perfor¬ 
mance  is  very  poor.  This  might  be  improved 
by  changing  the  network  configuration,  i.e. 
by  including  additional  hidden  layers. 

•  The  empirical  statistical  moments  and  the 
B-spline  coefficients  give  the  best  classifica¬ 
tion  results  independent  of  the  classification 
algorithm  used. 

•  The  Phong  and  Meister  approximation 
based  features  do  not  show  the  desired  dis¬ 
criminatory  power.  A  possible  cause  is  that 
the  nonlinear  optimization  might  lead  to  dif¬ 
ferent  parameter  configurations  for  almost 
indentical  BRDFs.  This  could  be  improved 
by  applying  better  start  parameters  and  by 
checking  the  parameter  configuration  during 
the  optimization  process  for  validility  due  to 
the  BRDF  model. 

4  Recognition  of  Objects  in  3D 
Laser  Radar  Images 

4.1  Basics  of  Laser  Range  Gating 

There  are  different  possibilities  to  generate  3D- 
images  at  a  long  range  (>  100m)  with  a  pulse 
IcLser: 

•  scanning  camera  system  that  combines  a 
scanning  system  (rotating  mirror)  with  a  sin¬ 
gle  detector  (Avalanche  Photodiode  APD) 
[10], 

•  APD  array  receiver  [11,  12], 

•  gated  viewing  camera  [13,  14]. 

These  systems  could  be  used  in  automatic  tar¬ 
get  recognition,  aimpoint  selection,  target  track¬ 
ing  and  obstacle  avoidance  [15,  16,  17,  18]. 


What  we  will  mainly  examine  here  are  imag¬ 
ing  laser  radar  systems,  which  are  based  on  a 
gated  viewing  camera.  Figure  16  demonstrates 
the  principle  of  laser  range  gating.  The  scene  is 
illuminated  with  a  pulsed  laser  and  synchronized 
to  a  gated  viewing  camera.  The  delay  of  camera 
exposure  and  the  duration  of  the  exposure  can  be 
controlled  with  an  accuracy  of  Ins.  It  is  possible 
to  achieve  exposure  times  Atexp  of  3ns.  With 

(Atexp^+  Atpuls)  ^ 

where  c  is  the  velocity  of  light  and  Atpuis  is  the 
pulse  width  of  the  laser,  it  can  be  recognized  that 
the  camera  frame  includes  only  the  scene  at  a 
given  range  with  a  depth  AR  of  1.5m  with  a  pulse 
width  Atpuis  of  7ns.  The  very  fast  shutter  speed 
of  3ns  can  be  achieved  with  a  MCP  (Multi  Chan¬ 
nel  Plate),  Generation  II.  The  frame  is  generated 
only  from  the  reflected  photons  from  the  target 
slice  T5  (Figure  16).  The  photons  of  the  scat¬ 
tered  light  outside  the  target  slices  T1-T4,T6-T9 
(Figure  16)  do  not  reach  the  camera  inside  the 
exposure  time  and  will  therefore  not  be  added  to 
the  image. 

The  timing  can  be  controlled  at  every  laser  shot 
and  consequently  one  gets  illuminated  slices  at 
different  ranges  with  different  depths.  The  com¬ 
bination  of  all  slices  gives  a  3D-image.  But  nor¬ 
mally  it  is  not  useful  to  combine  the  range  gated 
images  because  an  image  processing  system  can 
extract  faster  more  information  out  of  the  single 
range  gated  images. 

4.2  Eye-safety 

The  laser  radar  system  mentioned  in  section 
4.1  has  one  big  disadvantage.  It  is  operating  in 
the  visible  region  at  a  wavelength  of  532nm.  For 
an  ’’all  weather”  long  range  imaging  laser  radar 
with  the  requirement  of  high  laser  power  it  is  use¬ 
less  due  to  the  extreme  eye-hazard  near  the  max¬ 
imum  sensitivity  of  the  human  eye.  These  laser 
radars  working  with  a  MCP  are  nowadays  only 
practical  for  underwater  applications  like  mine 
detection  or  controlling  of  autonomous  underwa¬ 
ter  robots. 

Achieving  more  laser  power  without  an  eye- 
hazard  is  only  possible  beyond  a  wavelength 
1.4^m.  In  this  spectral  region  there  is  hardly  any 
transmission  between  the  cornea  and  the  retina, 
so  that  the  incoming  light  is  not  focussed  on  the 
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retina.  The  laser  power  limits  beyond  lAjim  are 
nearly  the  same  for  the  eye  and  the  skin.  Pro¬ 
viding  a  class  1  laser  radar  system  means  not  to 
exceed  the  AEL  (Accessible  Emission  Limit)  out¬ 
side  the  transmitter  optic.  A  class  1  system  can 
operate  without  any  safety  rules  [19]. 

For  laser  radar  systems  the  AEL  depends  on  the 
number  of  pulses  N  that  could  reach  the  cornea 
inside  a  time  period  of  100s: 

AELjjiultipulse  ^  ^  AE L singlepulse  •  (31) 

Table  2  shows  that  the  permitted  laser  power  in 
the  eye-safe  spectral  region  beyond  1.4/xm  can  be 
much  higher  than  in  the  visible. 

For  an  ”all  weather”  long  range  imaging  laser 
radar  system  it  is  absolutely  necessary  that  it  op¬ 
erates  in  the  eye-safe  region.  High  power  laser 
sources  like  Raman  shifted  Nd-YAG-,  Nd-YAG 
OPO (Optical  Parametric  Oscillator)-  or  Erbium 
doped  fiber  lasers  are  available  on  the  market,  but 
imaging  receiver  working  beyond  l.l//m  are  very 
rare. 

4.3  Design  of  an  eye-safe,  imaging 
laser  radar 

The  development  of  an  eye-safe,  imaging,  scan¬ 
nerless  laser  radar  based  on  gated  viewing  re¬ 
quires  another  shutter  principle  because  the  use 
of  a  MCP  in  the  eye-safe  region  of  [20] 

seems  not  realistic  in  the  near  future  [21].  The 
requirements  for  such  a  system  are: 

•  high  power  eye-safe  laser  source  for  single 
shot  illumination, 

•  high  sensitive  detector  at  a  wavelength  of 
1.5//m, 

•  very  fast  shutter  in  the  nanosecond  region, 

•  precise  delay  generator  with  a  stability  in  the 
picosecond  region. 

The  whole  system  design  is  shown  in  Figure  17. 
We  built  a  complete  computer  controllable  sys¬ 
tem  including  a  BigSky  Nd:YAG  laser  CFR  400 
OPO,  an  InGaAs  camera  SU128-1.7RT  CCIR 
from  Sensor  Unlimited  [22],  a  delay  generator 
EG&G  model  9650A  and  an  electro-optical  mod¬ 
ulator  EOM  from  Conoptics. 

The  InGaAs  camera  has  a  quantum  efficiency  of 
68%  at  a  wavelength  of  1.5//m  and  a  detectiv¬ 
ity  D*  of  10^^^”^^^^.  The  camera  works  at  a 


video  rate  of  25Hz  and  triggers  the  laser.  At  this 
rate  the  laser  achieves  a  pulse  energy  of  60mJ 
and  a  pulse  width  of  7ns.  The  pump  energy  of 
the  flashlamp  pumped  laser  can  be  controlled  via 
RS232,  so  that  it  is  possible  to  increase  or  de¬ 
crease  the  laser  output  energy  every  laser  shot. 
The  Q-switch  sync  output  is  used  to  get  the  time 
of  laser  start  and  to  trigger  the  delay  genera¬ 
tor  which  gets  the  delay  time  and  pulse  duration 
via  RS232.  The  electrical  output  impulse  with 
a  given  delay  and  pulse  width  opens  the  EOM 
shutter  for  a  duration  between  18ns  and  500ns 
corresponding  to  the  input  pulse  width.  Opening 
the  EOM  shutter  means  that  the  dc  voltage  level 
at  the  variable  dc  voltage  supply  provides  the  half 
wave  voltage.  The  EOM  shutter  we  use  here  is  of 
the  transverse  field  type  with  the  material  lithium 
tantalate.  For  the  EOM  shutter  we  have  calcu¬ 
lated  and  designed  a  special  objective.  It  includes 
an  analyzer,  a  polarizer  and  an  IR  bandpass  fil¬ 
ter  with  the  FWHM  optical  bandwidth  of  30nm. 
The  focal  length  of  the  objective  is  400mm. 

The  whole  system  design  integrates  an  eye-safe 
rangefinder  working  with  an  InGaAs  APD  as 
detector,  which  communicates  with  the  control 
computer  via  centronics.  The  rangefinder  does 
not  work  with  last  pulse  technique,  but  we  can 
set  a  range  gate  of  any  size  and  manage  to  get 
the  range  also  at  bad  weather  conditions  as  heavy 
rain  or  fog.  The  laser  radar  control  software  could 
use  the  range  information  from  a  target  to  set  the 
corresponding  delay  of  the  delay  generator.  The 
observation  of  the  target  per  range  gating  system 
could  start  at  this  location. 

We  have  finished  the  system  design  yet,  but  it  was 
not  possible  to  provide  outdoor  experimental  re¬ 
sults.  Therefore  we  use  for  the  example  of  object 
recognition  and  classification  section  4.5  a  range 
gating  sequence  provided  with  a  gated  viewing 
system  including  a  MCP. 

4.4  Long  range  experiments  v^ith  a 
gated  viewing  system  at  the  wave¬ 
length  of  532nm 

The  target  used  for  the  long  range  experiments 
was  a  truck  (Figure  18(a))  with  a  special  wire 
rack  (Figure  18(b)).  These  two  images  were  taken 
with  a  video  camera  Sony  CCD-TRVIOIE.  The 
truck  painted  with  olive  green  drab  is  standing  in 
a  distance  of  740m  to  the  measurement  house.  It 
is  nearly  a  Lambertian  target  with  a  reflectivity 
p  =  0.1,  The  wire  rack  consists  of  a  wooden  frame 
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including  following  crossbars  from  top  to  buttom: 


type 

diameter  [mm] 

copper  wire 

3.3 

cable 

6 

cable 

13 

aluminum  pipe 

20 

aluminum  pipe 

30 

aluminum  pipe 

60 

Figure  19  shows  a  sequence  of  range  gated  images. 
Every  image  in  the  sequence  is  a  mean  value  im¬ 
age  of  20  single  range  gated  images  with  the  same 
delay  and  duration  of  the  gate.  The  whole  se¬ 
quence  consists  of  8  range  slices  corresponding  to 
the  8  range  values  (grey  scale  bar  in  the  range 
image  Figure  20(b)).  Because  the  images  were 
taken  with  25Hz  it  took  6.4s  to  get  the  sequence 
and  the  resulting  range  image  Figure  20(b), 

Only  6  of  the  whole  8  images  presented  in  Figure 
19  because  the  range  slices  starting  at  0.58km  and 
1.47km  range  do  not  contain  information.  The 
duration  of  the  gate  was  993ns  for  every  image 
slice  that  means  with  (30)  a  range  depth  AR  of 
150m.  Table  3  displays  the  corresponding  system 
parameters  for  the  image  sequence  of  Figure  19. 
There  are  some  discernible  peculiarities  in  the  im¬ 
age  sequence.  In  the  first  image  of  the  sequence 
the  plants  are  blurred,  because  the  focus  is  set  to 
the  truck  in  the  range  of  740m.  Consequently  the 
visible  shadow  of  the  plants  in  the  second  image 
is  sharp.  The  solution  of  this  problem  is  the  in¬ 
tegration  of  a  focus  control  or  image  processing 
like  an  image  restauration  with  a  Wiener  filter. 
The  next  peculiarity  can  be  seen  in  Figure  19(e). 
The  image  has  a  lot  of  overlight  in  the  left  top  side 
which  causes  scattered  light  in  the  objective.  The 
reason  for  this  is  a  retroreflector  in  the  middle  of 
the  target.  But  the  existence  of  such  a  hot  spot 
illustrates  the  effective  of  range  gating.  The  range 
slice  of  the  truck  is  not  influenced  by  the  hot  spot. 
Consequently,  the  truck  can  be  detected  clearly. 
The  different  range  slices  are  combined  to  a  range 
image.  The  range  image  Figure  20(b)  is  calcu¬ 
lated  with  a  simple  threshold  of  20.  Every  grey 
value  is  corresponding  to  a  range  as  it  can  be 
seen  in  the  grey  scale  bar  in  the  right  side  of  the 
image.  The  grey  value  of  0  (black)  at  a  location 
(pixel  (x,y))  is  used  if  there  is  no  grey  value  in  the 
sequence  higher  than  the  threshold.  The  choice  of 
a  threshold  depends  on  the  maximum  noise  value 
after  building  the  mean  value  image. 

For  the  computer  controllable  system  it  is  impor¬ 
tant  to  consider  the  enormous  influence  of  the 


choosen  MCP  gain  on  the  image  noise.  A  high 
gain  provides  a  very  strong  excess  noise  as  it  can 
be  seen  in  Figure  20(a).  This  is  a  corresponding 
single  image  to  the  mean  value  image  (20  single 
images)  of  Figure  19(b).  The  system  works  here 
with  maximum  MCP  gain  of  18000. 

4.5  Example  of  Object  Recognition 
with  a  Laser  Radar  Range  Gated 
Sequence 

4.5.1  Introduction 

In  the  example  it  will  be  shown  that  object  recog¬ 
nition  in  a  laser  radar  image  sequence  has  a  lot 
of  advantages  in  comparison  to  normal  grey  value 
images.  That  means  that  in  a  lot  of  cases  an  auto¬ 
matic  object  detection,  the  classification  of  these 
objects  and  the  assignment  to  different  classes  is 
only  possible  while  using  range  gating. 

The  nature  of  range  gating  is  that  only  a  spe¬ 
cial  object  is  illuminated  and  the  background  and 
foreground  are  both  black.  For  digital  image  pro¬ 
cessing  that  means  that  the  first  step  of  an  ob¬ 
ject  recognition  algorithm,  object  segmentation, 
can  be  done  by  simple  thresholding.  While  look¬ 
ing  at  the  image  sequence  in  Figure  19  it  can  be 
imagined  that  the  normal  grey  value  image  of  this 
scene  does  not  allow  to  detect  the  truck  because 
the  grey  values  of  the  bush  in  the  foreground  and 
the  forest  in  the  background  include  nearly  the 
same  scope. 

The  next  big  advantage  is  the  knowledge  of  the 
distance  of  a  range  slice.  With  the  distance  and 
the  main  parameter  of  the  objective  and  the  cam¬ 
era  it  is  no  problem  to  calculate  the  real  size  of  an 
object.  The  same  target  in  different  ranges  has 
always  a  different  size  in  pixel  in  the  image.  In 
the  range  gated  images  the  correct  size  in  meter 
of  the  target  can  be  calculated  and  therefore  for 
an  object  recognition  algorithm  the  object  size  is 
range  independent. 

In  the  next  subsection  we  want  to  discuss  the  nec¬ 
essary  steps  of  an  object  recognition  algorithm  for 
the  automatic  detection  of  the  truck  in  the  range 
gating  sequence  of  section  4.4  in  detail.  It  can  be 
imagined  that  the  same  algorithm  is  also  capa¬ 
ble  to  distinguish  different  types  of  trucks,  tanks, 
helicopters  or  different  weapon  systems.  The  ob¬ 
ject  features  we  use  here  are  only  a  small  choice  of 
the  possible  features.  Especially  for  the  classifi¬ 
cation  of  geometric  similar  objects  the  signature 
moments  could  be  a  promissing  choice.  In  the 
example  we  do  not  use  the  object  signature. 
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rain  snow  fog  haze 


Figure  16:  Principle  of  laser  range  gating 


laser  type 

frequency-doubled  Nd-YAG 

Nd-YAG 

Nd-YAG  OPO 

wavelength  [nm] 

532 

1064 

1574 

pulsewidth  A^[ns] 

9 

9 

9 

pulse  frequency  [Hz] 

30 

30 

30 

pulses  N  in  100s 

3000 

3000 

3000 

CLASS  1  LIMITS 

AE  Lgijigl^py^lgQ  [J] 

2  He  10-"^ 

2  *  10-® 

8  *  10-^ 

AE  Ljjiultipulsei^l 

2.7*  10-* 

2.7*  10-^ 

1.1  *  10-3 

laser  peak  power  [W] 

<  3 

<  30 

<  120000 

Table  2:  Comparison  of  the  class  1  AEL  (Accessible  Emission  Limit)  for  different  Nd-YAG  Icisers 
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image  number 
(Figure  19) 

range  [km] 

gate  delay  [ns] 

gate  duration  [ns] 

pulse  energy  [mJ] 

maximum 
grey  value 

(a) 

0.43 

2721 

993 

90 

70 

(b) 

0.73 

4705 

993 

90 

224 

(c) 

0.88 

5697 

993 

90 

47 

(d) 

1.02 

6689 

993 

90 

52 

(e) 

1.17 

7681 

993 

90 

255 

(f) 

1.32 

8673 

993 

90 

97 

Table  3:  System  parameters  for  the  image  sequence  of  Figure  19 
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Figure  18:  Video  image  of  the  truck  target  (a)  and  the  special  wire  rack  (b) 
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Figure  19:  Sequence  of  range  gated  images  at  different  ranges  between  0.43km  (a)  to  1.32km  (f) 
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Figure  20:  (a)  Single  range  gated  image  (false  color),  high  gain  of  18000  and  therefore  a  lot  of  excess 
noise;  (b)  Range  image  calculated  with  a  threshold  of  20 
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4.5.2  Preprocessing  and  Segmentation 

Before  executing  the  segmentation  step  it  is  al¬ 
ways  recommendable  to  concentrate  on  prepro¬ 
cessing.  For  laser  radar  image  sequences  there  are 
a  lot  of  different  possiblities  to  enforce  the  qual¬ 
ity  of  the  single  images  of  the  sequence.  Laser 
radar  systems  which  are  working  with  a  MCP  of¬ 
ten  generate  a  lot  of  excess  noise  as  it  can  be  seen 
in  Figure  20.  The  excess  noise  depends  on  the 
MCP  gain  which  depends  mainly  on  the  weather, 
the  choosen  range  and  the  maximal  possible  laser 
output  power.  The  system  parameters  imply  the 
necessity  of  a  special  gain  and  therefore  we  have 
to  live  with  the  noise. 

In  the  literature  a  lot  of  different  noise  removal 
filters  are  mentioned,  which  concentrate  on  one 
single  image.  In  our  case  there  is  a  much  easier 
and  more  effective  way.  Because  the  process  of 
image  capturing  can  be  controlled,  we  can  take 
more  images  of  the  same  range  slice.  Now  there 
are  three  ways  to  combine  the  images: 

•  geometric  averaging, 

•  median  averaging  or 

•  robust  averaging. 

In  the  example  we  use  the  geometric  averaging  of 
20  single  images  for  evety  range  slice.  The  robust 
averaging  would  be  more  effective  if  the  aim  is 
to  minimize  the  number  of  images  or  if  there  are 
outliers  in  the  rows  or  in  a  whole  image. 
Normally  the  segmentation  of  objects  is  a  very 
simple  step  in  range  gating  sequences,  namely  a 
simple  thresholding.  The  threshold  only  depends 
on  the  system  noise  and  the  noise  removal  filter  or 
the  image  averaging  process.  Since  the  treshold 
is  known,  every  pixel  with  a  grey  value  above  the 
threshold  is  an  object  pixel.  The  threshold  value 
in  the  example  is  fixed  to  22. 

The  correct  choice  of  a  range  slice  depth  AR  (30) 
in  combination  with  the  depth  of  the  target  ATR 
to  detect  is  important  for  minimizing  foreground 
and  background  pixels  which  are  in  connection 
with  the  pixels  of  the  target.  If  the  possible  dis¬ 
tance  of  the  target  is  unknown  and  the  range  slice 
depth  is  nearly  the  same  as  the  target  depth,  the 
target  could  be  splitted  in  two  different  image 
slices.  To  be  sure  that  one  image  slice  of  the 
range  gated  sequence  includes  the  whole  target, 
the  range  slice  depth  AR  has  to  be  two  times 
the  target  depth  ATR  and  the  range  slices  have 


to  overlap.  Therefore  the  exposure  time  At^xp  is 
calculated  to 

Atexp  —  ^  —  Atpuls  (^2) 

and  the  exposure  delay  step  At^ei  has  to  be  fixed 
to 

2^  ATR  ,  ,  , 

^^del  —  ^  ~  ^ipuls' 

The  combination  of  the  laser  radar  system  control 
and  the  digital  image  processing  of  the  laser  radar 
image  sequences  provides  a  very  high  effficiency 
of  the  object  recognition  and  classification. 

4.5.3  Binary  Image  Processing 

The  result  of  the  segmentation  is  a  binary  image. 
The  next  step  is  the  filtering  of  the  generated  bi¬ 
nary  image  to  delete  single  pixels.  The  single  pix¬ 
els,  i.e.  pixels  with  the  value  1  with  no  other  pixel 
in  the  direct  neighbourhood  including  this  value, 
are  just  set  to  0  and  so  the  number  of  objects 
stays  small.  The  next  filter  is  a  closing  filter  to 
merge  objects  which  are  in  the  direct  neighbour¬ 
hood.  It  is  probable  that  these  different  objects 
are  in  fact  the  same,  because  the  overlapping  from 
objects  in  different  ranges  is  minimal  because  of 
the  range  gating. 

Different  objects  in  range  gating  sequences  are 
most  often  connected  only  through  the  back¬ 
ground,  if  the  range  slice  depth  is  too  large.  In 
the  example  the  range  depth  AR  is  150m  and 
this  is  very  large,  so  we  get  this  problem.  The 
separation  of  the  object  from  the  background  is 
only  possible  if  the  grey  values  of  all  detected  ob¬ 
jects  are  analysed.  But  in  the  most  cases  it  is  not 
necessary  to  do  this. 

The  Ictst  step  to  separate  the  object  is  the  genera¬ 
tion  of  an  object  image,  which  has  different  num¬ 
bers  for  every  corresponding  area.  Two  objects 
are  distinguished  when  no  pixel  of  one  object  is 
in  the  direct  neighbourhood  of  each  pixel  of  the 
other  object. 

The  object  image  is  the  basis  for  the  calculation 
of  the  object  features. 

4.5.4  Object  Features 

The  choosen  object  features  for  the  example  are 
1.  Area  A[m^], 
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2.  Major  Axis  Length  maA  [m], 

3.  Minor  Axis  Length  miA[rn\^ 

4.  Eccentricity  (<1), 

5.  Major  Axis  Orientation, 

6.  Solidity  (<1), 

7.  Mean  of  the  object  grey  values, 

8.  Standard  Deviation  of  the  object  grey  values, 


often  coupled  with  the  CCD  chip  through  a  fiber 
taper. 

At  this  moment  it  is  possible  to  integrate  an¬ 
other  image  processing  step  to  reduce  the  num¬ 
ber  of  objects.  Before  the  calculation  of  the  10 
object  features  for  every  object  in  the  sequence, 
an  elimination  of  small  objects  with  a  pixel  area 
Apix  <  Amin^  with  the  threshold  area  Amini  is 
useful  to  decrease  the  computation  time  and  in¬ 
crease  the  uniformity  of  the  classes.  In  our  exam¬ 
ple  the  threshold  is  fixed  to  Arain  —  100* 


9.  Skewness  of  the  object  grey  values, 

10.  Kurtosis  of  the  object  grey  values 
The  features  are  separable  in  three  groups: 

•  feature  1-3:  special  laser  radar  form  fea¬ 
tures, 

•  feature  4-6:  normal  form  features  and 

•  feature  7-10:  statistic  moments  of  the  grey 
values  of  each  object. 

The  calculation  of  the  first  three  features  presup¬ 
pose  knowledge  of  the  laser  radar  system  design. 
The  real  size  of  an  object  pixel 


_  d^ 

^opix  —  J  ^  Xt, 

(34) 

with  the  physical  width  of  the  camera  pixel  dpix^ 

the  focal  length  /  and  the  object  range  i?, 
used  to  calculate 

can  be 

A  —  Apix  *  ^opix'i 

(35) 

maA  ^  maAp'i^  ^  ^opix  ^ 

(36) 

mi  A  ~  mi  Apix  *  ^opix* 

(37) 

Apix^  miApix^  maApix  are  the  corresponding  pixel 
values  to  the  special  laser  radar  form  features 
A,  miA,  maA,  For  a  gating  viewing  camera  in¬ 
cluding  a  MCP  with  a  diameter  (Imcp  of  18mm 
and  a  detector  field  of  768x572  pixel,  the  physical 
width  of  the  camera  pixel 

,  _  dMCP 

”  V7682  +  5722 

can  be  calculated  to  18.8/im.  It  is  supposed  that 
the  diameter  of  the  MCP  is  corresponding  to  the 
diagonal  of  the  CCD  chipu  because  the  MCP  is 

V 


4.5.5  Classes  and  Classification 

The  choice  of  the  classes  depends  on  the  problem 
to  solve.  For  the  example  we  define  4  classes: 

•  class  1:  hotspot, 

•  class  2:  truck, 

•  class  3:  background, 

•  class  4:  bush  (foreground). 

Working  with  laser  radar  includes  always  the 
problem  of  hotspots,  where  the  laser  light  has 
a  strong  reflection.  In  the  laser  radar  sequence 
here,  it  is  a  retroreflector,  however  it  also  can  be 
a  windscreen  or  a  window  of  a  house. 

The  classes  3  and  4  are  choosen  to  show  the  prob¬ 
lem  of  an  object  classification.  The  similarity  of 
these  two  classes  are  high  and  so  it  can  be  ex¬ 
pected  that  there  will  be  errors  in  the  classifica¬ 
tion. 

The  method  of  classification  used  here  is  a 
weighted  distance  classificator,  because  the  train¬ 
ing  set,  that  means  the  number  of  observations  for 
every  class,  is  to  small  for  a  statistical  classifica¬ 
tor.  The  training  set  is  used  to  calculate  for  every 
class  i,  i  =  1...^  of  the  given  c  classes,  the  mean 
matrix  Xik  and  the  standard  deviation  Matrix  Gik 
for  every  feature  k  of  the  given  //  features. 

The  object  feature  matrix  Yjk  for  the  objects  j 
includes  all  object  features  of  the  o  objects  of  the 
whole  Iciser  radar  image  sequence. 

The  weighted  distances 

ff  If  . 

dij  =  *  E  — (39) 

A:=l  Jk=l 

can  be  used  to  find  the  class  m  of  an  object  j 
while  determining  the  minimum  distance 

dmirij  =  V  i  =  l...c.  (40) 

The  object  j  is  consequently  classified  to  class 
dmirij  =  m,  m  G  l...c. 


40  60  80  20  40  60  80 

size  X  [Pixel] 

Figure  21:  Object  14  to  18  of  the  laser  radar  sequence  in  detail 
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Figure  22:  Image  features  of  all  20  objects  of  tlie  laser  radar  sequence 
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Figure  24:  Corresponding  orginal,  object  and  class  images  of  the  laser  radar  sequence,  part  II 
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4.5.6  Results 

The  rows  in  Figure  21  show  5  objects  out  of  the 
whole  laser  radar  image  sequence,  which  should 
demonstrate  the  digital  image  processing  step  of 
the  object  segmentation.  The  first  column  in  Fig¬ 
ure  21  shows  the  binary  images  of  object  14. ..18  of 
the  laser  radar  image  lki06.tif  after  threshold¬ 
ing  and  filtering  with  the  closing  filter.  The  single 
binary  images  are  printed  in  quadratic  form,  but 
the  real  pixel  size  can  be  found  at  the  axis. 

The  second  column  in  Figure  21  shows  the  grey 
values  of  the  object  areas  with  the  range  of  the 
grey  values  fixed  in  the  bar  on  the  left  side  of  the 
image  and  the  third  column  only  the  grey  values 
of  the  objects.  Pixels  which  do  not  belong  to  the 
object  in  column  three  are  set  to  zero.  These 
images  are  in  their  size  real  parts  of  the  orginal 
laser  radar  image  lki06.tif. 

After  the  segmentation  and  the  binary  image  pro¬ 
cessing  20  objects  in  the  whole  sequence  are  sep¬ 
arated.  The  10  diagrams  of  Figure  22  show  the 
10  object  features  for  all  20  objects.  The  object 
number  at  the  x  axis  is  corresponding  to  the  num¬ 
ber  of  the  objects  14... 18  of  Figure  21.  In  the  most 
diagrams  of  Figure  22  there  are  2  peaks  for  ob¬ 
ject  4  and  object  13.  These  objects  are  suspicious 
and  it  should  be  possible  to  classify  them  easily. 
In  fact  object  4  is  the  truck  and  object  13  is  the 
hotspot. 

All  images  of  the  sequence,  all  objects  and  the 
whole  clcissification  result  can  be  seen  in  Figure  23 
and  Figure  24.  The  first  column  shows  the  range 
gated  images  of  the  whole  sequence.  Each  of  the 
orginal  images  is  contrast  enforced  to  see  every 
detail.  That  means  that  also  the  noise  seems  to  be 
stronger  if  the  grey  value  region  of  the  individual 
image  is  small.  The  second  and  third  column  of 
Figure  23  and  Figure  24  show  the  object  arid  the 
class  image  to  each  orginal  image  in  the  same 
row.  The  bar  next  to  the  object  images  includes 
the  object  number  and  the  bar  next  to  the  class 
images  the  class  of  each  object.  The  lowest  values 
in  the  bars  mark  the  value  for  the  background. 

The  classification  result  is  relatively  good,  there 
are  only  2  faults.  The  bushes  in  the  first  image 
of  the  sequence  are  difficult  to  classify,  so  that 
objects  1  and  3,  which  are  in  reality  bushes  are 
marked  as  background. 

The  classification  of  the  truck  and  the  hotspot  is 
no  problem. 


5  Real-  Time  Considerations 


5.1  List  of  symbols 


Symbol 

Description 

c# 

number  of  stored  constants 

/ 

ideal  classificator  - 

FADD# 

number  of  additions 

FDIV# 

number  of  divisions 

FMUL# 

number  of  multiplications 

k 

number  of  groups 

rud 

number  of  discriminant  functions 

rui 

number  of  neurons  in  hidden  layer 

nir 

number  of  rule  in  fuzzy  model 

n 

dimension  of  input  vectors 

X 

inputvector 

y 

outputvector 

5.2  Basic  Considerations 

In  this  section  the  usefullness  of  pattern  recogni¬ 
tion  methods  in  real  time  applications  will  be  dis¬ 
cussed.  The  methods  discussed  in  ’’Approaches 
to  pattern  recognitions”  section  4.  as  they  are 

1.  discriminant  analysis 

2.  feedforward  neural  network  (one  hidden 
layer) 

3.  first  order  Sugeno  fuzzy  model 

will  be  examined.  Based  on  the  section  4  of 
’’Approaches  to  pattern  recognitions”  a  black  box 
function  /  is  defined  as 

y  =  fix)  e  {1,2,...,  A;}. 

Function  /  is  the  ideal  classification  operator 
which  separates  given  input  vectors  x  into  groups 
labeled  y.  It  will  be  assumed  that  the  applica¬ 
tions  of  the  three  discussed  pattern  recognition 
methods  approximate  function  /  for  the  same 
process.  The  comparison  of  the  three  pattern 
recognition  methods  will  be  done  regarding  the 
following  two  quantities: 

•  number  of  floating  point  operations 

•  number  of  stored  constants. 

The  number  of  floating  point  operations  is  a  cri¬ 
terion  to  compare  the  run  time  of  numerical  algo¬ 
rithms  independently  of  computation  platforms. 
The  comparison  will  be  splitted  in  floating  point 
additions  (FADD),  floating  point  multiplications 
(FMUL),  and  additionally  floating  point  divi¬ 
sions  (FDIV),  because  the  run  time  of  an  FADD, 
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FMUL  and  FDIV  is  depending  on  the  used  hard> 
ware  processing  system  for  instance  ([24], [25]).  A 
sufficiently  good  approximation  is  counting  the 
FADD  and  FMUL  operations,  all  other  floating 
point  operations  will  be  neglected.  In  addition 
to  the  number  of  floating  point  operations  the 
memory  usage  (number  C  of  stored  constants) 
is  an  indicator  of  necessary  number  of  memory 
accesses.  Be  k  the  number  of  groups,  n  the  di¬ 
mension  of  the  input  vector  x,  rrid  the  number  of 
discriminant  functions,  mi  the  number  of  neurons 
in  the  hidden  layer  of  a  feedforward  network, 
the  number  of  used  rules  in  a  first  order  Sugeno 
fuzzy  model.  Then  the  number  of  FADD,  FMUL, 
FDIV  and  stored  constants  C  for  the  three  differ¬ 
ent  pattern  recognitions  method  are  given  by: 


method 

FMUL# 

discriminant 

analysis 

md{k  +  n) 

feedforward 
neural  network 

mi{n  +  k) 

first  order  Sugeno 
fuzzy  model 

mr{n  +  kn  +  k) 

(+  TUrk  FDIV#) 

method 

FADD# 

discriminant 

analysis 

md{k  +  n  -  1) 

feedforward 
neural  network 

mi{n  —  1)  +  w 
+k{mi  -  1)  +  TO,- 

first  order  Sugeno 
fuzzy  model 

kmr(n  —l)-\-kn 

+(TOr  -  l)k 

method 

c# 

discriminant 

analysis 

md{k  +  n) 

feedforward 
neural  network 

min  +  n  +  kmi  +  mi 

first  order  Sugeno 
fuzzy  model 

2mrn  -|-  kmrU 

In  the  case  of  examples  in  ’’Approaches  to  pattern 
recognitions”  section  4  the  constants  were  defined 
as  n  =  7,fc  =  A^md  =  2,  =  7,  and  =  5.  The 

table  below  shows  results  of  FMUL:?5t,FADD#, 
and  C#  for  a  single  classification. 


method 

FMUL# 

FADD# 

c# 

discriminant 

analysis 

22 

20 

22 

feedforward 
neural  network 

77 

79 

91 

first  order  Sugeno 
fuzzy  model 

195 

(+20) 

164 

210 

Compared  to  the  first  order  Sugeno  fuzzy  model 
the  discriminant  analysis  needs  only  about  ten 
percent  and  the  feedforward  neural  network  less 
than  fifty  percent  of  run  time.  This  result  is  con¬ 
vincing  so  that  we  prefer  the  discriminant  analy¬ 
sis  to  perform  single  classifications. 

5.3  Practical  Considerations 

For  practical  considerations  the  computation 
time  of  diflFerent  classification  algorithms  will 
be  compared.  The  reference  computation 
system  was  a  Pentium  Pro  200  MHz  Win¬ 
dowsNT  PC  workstation  with  256  MB  RAM. 
The  classification  algorithms  were  built  using 
the  MATLAB^^  computation  system,  and  the 
M AT LAB^^ Compiler  version  5.2  . 

Table  4  shows  the  computation  time  of  20  classi¬ 
fications  using  the  example  of  section  4  in  ’’Ap¬ 
proaches  to  pattern  recognitions” . 


method 

run  time  (ms) 

discriminant 

analysis 

0.8 

feedforward 
neural  network 

2.5 

first  order  Sugeno 
fuzzy  model 

3.9 

Table  4:  Run  time 


Compared  to  the  first  order  Sugeno  fuzzy  model 
the  discriminant  analysis  needs  only  twenty  per¬ 
cent  and  the  feedforward  neural  network  less  than 
sixty  six  percent  of  run  time. 

Additional  a  PCI  board  with  four  neurochips  was 
used  to  perform  the  classification  task.  This  PCI 
board  achives  a  peak  performance  of  800  MCPS 
(mega  connections  per  second) .  Running  feedfor¬ 
ward  neural  network  based  classifications  on  the 
neurochip  hardware  gives  a  computation  time  of 
0.28ms  per  20  classifications. 


12-24 


6  Conclusions 

Practical  applications  of  pattern  recognition  are 
possible  with  very  different  approaches  and  the 
use  of  numerous  varying  feature  variables.  How¬ 
ever,  one  has  to  pay  attention  to  the  fact  that 
the  hardware  of  the  pattern  recognition  system 
is  designed  to  meet  the  needs  of  the  classification 
approaches. 

Simple  statistical  descriptors  and  more  or  less 
complicated  statistical  classifiers  should  be  cho¬ 
sen  for  first  experiments.  Often  it  turns  out 
that  they  provide  as  well  superior  overall  per¬ 
formance  with  respect  to  correctness  of  classifi¬ 
cation  as  minimal  computational  power  require¬ 
ments  (time,  storage  capacity). 
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